Overview
Brought to you by YData
Dataset statistics
| Number of variables | 61 |
|---|---|
| Number of observations | 2430 |
| Missing cells | 1875 |
| Missing cells (%) | 1.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 496.0 B |
Variable types
| Categorical | 52 |
|---|---|
| Numeric | 4 |
| Text | 3 |
| Boolean | 2 |
AJCC ID (2018+) is highly overall correlated with Derived EOD 2018 M Recode (2018+) and 6 other fields | High correlation |
COD to site rec KM is highly overall correlated with COD to site recode and 5 other fields | High correlation |
COD to site recode is highly overall correlated with COD to site rec KM and 5 other fields | High correlation |
COD to site recode ICD-O-3 2023 Revision is highly overall correlated with COD to site rec KM and 5 other fields | High correlation |
COD to site recode ICD-O-3 2023 Revision Expanded (1999+) is highly overall correlated with COD to site rec KM and 5 other fields | High correlation |
Chemotherapy recode (yes, no/unk) is highly overall correlated with Derived EOD 2018 Stage Group Recode (2018+) and 3 other fields | High correlation |
Derived EOD 2018 M Recode (2018+) is highly overall correlated with AJCC ID (2018+) and 8 other fields | High correlation |
Derived EOD 2018 N Recode (2018+) is highly overall correlated with AJCC ID (2018+) and 7 other fields | High correlation |
Derived EOD 2018 Stage Group Recode (2018+) is highly overall correlated with AJCC ID (2018+) and 5 other fields | High correlation |
Derived EOD 2018 T Recode (2018+) is highly overall correlated with AJCC ID (2018+) and 6 other fields | High correlation |
Derived Summary Grade 2018 (2018+) is highly overall correlated with Grade Clinical (2018+) and 1 other fields | High correlation |
EOD Mets Recode (2018+) is highly overall correlated with Derived EOD 2018 M Recode (2018+) and 2 other fields | High correlation |
EOD Primary Tumor Recode (2018+) is highly overall correlated with Derived EOD 2018 T Recode (2018+) | High correlation |
EOD Regional Nodes Recode (2018+) is highly overall correlated with Derived EOD 2018 N Recode (2018+) | High correlation |
First malignant primary indicator is highly overall correlated with Record number recode and 2 other fields | High correlation |
Grade Clinical (2018+) is highly overall correlated with Derived Summary Grade 2018 (2018+) | High correlation |
Grade Pathological (2018+) is highly overall correlated with Derived Summary Grade 2018 (2018+) | High correlation |
Median household income inflation adj to 2023 is highly overall correlated with Patient ID | High correlation |
Mets at DX-Distant LN (2016+) is highly overall correlated with Mets at DX-Other (2016+) and 4 other fields | High correlation |
Mets at DX-Other (2016+) is highly overall correlated with Mets at DX-Distant LN (2016+) and 4 other fields | High correlation |
PRCDA 2020 is highly overall correlated with Patient ID | High correlation |
Patient ID is highly overall correlated with Median household income inflation adj to 2023 and 1 other fields | High correlation |
Primary Site is highly overall correlated with AJCC ID (2018+) and 4 other fields | High correlation |
Primary Site - labeled is highly overall correlated with AJCC ID (2018+) and 4 other fields | High correlation |
RX Summ--Scope Reg LN Sur (2003+) is highly overall correlated with Reason no cancer-directed surgery and 1 other fields | High correlation |
RX Summ--Surg Prim Site (1998+) is highly overall correlated with RX Summ--Surg/Rad Seq and 1 other fields | High correlation |
RX Summ--Surg/Rad Seq is highly overall correlated with RX Summ--Surg Prim Site (1998+) | High correlation |
RX Summ--Systemic/Sur Seq (2007+) is highly overall correlated with Chemotherapy recode (yes, no/unk) | High correlation |
Reason no cancer-directed surgery is highly overall correlated with RX Summ--Scope Reg LN Sur (2003+) and 1 other fields | High correlation |
Record number recode is highly overall correlated with First malignant primary indicator and 2 other fields | High correlation |
Regional nodes examined (1988+) is highly overall correlated with RX Summ--Scope Reg LN Sur (2003+) and 1 other fields | High correlation |
Regional nodes positive (1988+) is highly overall correlated with Regional nodes examined (1988+) | High correlation |
SEER Combined Mets at DX-bone (2010+) is highly overall correlated with Mets at DX-Distant LN (2016+) and 4 other fields | High correlation |
SEER Combined Mets at DX-brain (2010+) is highly overall correlated with Mets at DX-Distant LN (2016+) and 4 other fields | High correlation |
SEER Combined Mets at DX-liver (2010+) is highly overall correlated with Derived EOD 2018 M Recode (2018+) and 6 other fields | High correlation |
SEER Combined Mets at DX-lung (2010+) is highly overall correlated with Mets at DX-Distant LN (2016+) and 4 other fields | High correlation |
SEER cause-specific death classification is highly overall correlated with COD to site rec KM and 5 other fields | High correlation |
SEER other cause of death classification is highly overall correlated with COD to site rec KM and 5 other fields | High correlation |
Sequence number is highly overall correlated with First malignant primary indicator and 2 other fields | High correlation |
Site recode ICD-O-3 2023 Revision Expanded is highly overall correlated with AJCC ID (2018+) and 4 other fields | High correlation |
Survival months flag is highly overall correlated with Type of Reporting Source | High correlation |
Total number of in situ/malignant tumors for patient is highly overall correlated with First malignant primary indicator and 2 other fields | High correlation |
Tumor Size Summary (2016+) is highly overall correlated with Chemotherapy recode (yes, no/unk) and 1 other fields | High correlation |
Type of Reporting Source is highly overall correlated with Survival months flag | High correlation |
Vital status recode (study cutoff used) is highly overall correlated with COD to site rec KM and 6 other fields | High correlation |
Year of follow-up recode is highly overall correlated with Vital status recode (study cutoff used) | High correlation |
Site recode ICD-O-3 2023 Revision Expanded is highly imbalanced (62.7%) | Imbalance |
Grade Clinical (2018+) is highly imbalanced (70.8%) | Imbalance |
Diagnostic Confirmation is highly imbalanced (87.9%) | Imbalance |
Derived EOD 2018 N Recode (2018+) is highly imbalanced (72.9%) | Imbalance |
Derived EOD 2018 M Recode (2018+) is highly imbalanced (55.2%) | Imbalance |
RX Summ--Surg Oth Reg/Dis (2003+) is highly imbalanced (84.5%) | Imbalance |
RX Summ--Surg/Rad Seq is highly imbalanced (98.8%) | Imbalance |
Reason no cancer-directed surgery is highly imbalanced (60.5%) | Imbalance |
Radiation recode is highly imbalanced (96.8%) | Imbalance |
RX Summ--Systemic/Sur Seq (2007+) is highly imbalanced (59.2%) | Imbalance |
EOD Primary Tumor Recode (2018+) is highly imbalanced (51.8%) | Imbalance |
EOD Regional Nodes Recode (2018+) is highly imbalanced (68.2%) | Imbalance |
EOD Mets Recode (2018+) is highly imbalanced (66.3%) | Imbalance |
Regional nodes examined (1988+) is highly imbalanced (67.5%) | Imbalance |
Regional nodes positive (1988+) is highly imbalanced (67.9%) | Imbalance |
SEER Combined Mets at DX-bone (2010+) is highly imbalanced (92.2%) | Imbalance |
SEER Combined Mets at DX-brain (2010+) is highly imbalanced (93.0%) | Imbalance |
SEER Combined Mets at DX-liver (2010+) is highly imbalanced (71.1%) | Imbalance |
SEER Combined Mets at DX-lung (2010+) is highly imbalanced (92.2%) | Imbalance |
Mets at DX-Distant LN (2016+) is highly imbalanced (91.7%) | Imbalance |
Mets at DX-Other (2016+) is highly imbalanced (78.5%) | Imbalance |
COD to site recode is highly imbalanced (82.1%) | Imbalance |
SEER cause-specific death classification is highly imbalanced (79.7%) | Imbalance |
SEER other cause of death classification is highly imbalanced (78.8%) | Imbalance |
Survival months flag is highly imbalanced (92.6%) | Imbalance |
COD to site rec KM is highly imbalanced (82.1%) | Imbalance |
COD to site recode ICD-O-3 2023 Revision is highly imbalanced (81.9%) | Imbalance |
COD to site recode ICD-O-3 2023 Revision Expanded (1999+) is highly imbalanced (82.0%) | Imbalance |
Vital status recode (study cutoff used) is highly imbalanced (50.0%) | Imbalance |
Sequence number is highly imbalanced (53.0%) | Imbalance |
Primary by international rules is highly imbalanced (96.5%) | Imbalance |
Record number recode is highly imbalanced (58.2%) | Imbalance |
Total number of in situ/malignant tumors for patient is highly imbalanced (53.9%) | Imbalance |
Total number of benign/borderline tumors for patient is highly imbalanced (93.7%) | Imbalance |
Year of follow-up recode is highly imbalanced (74.4%) | Imbalance |
Type of Reporting Source is highly imbalanced (80.7%) | Imbalance |
RX Summ--Scope Reg LN Sur (2003+) has 1875 (77.2%) missing values | Missing |
Reproduction
| Analysis started | 2025-07-24 19:16:54.658752 |
|---|---|
| Analysis finished | 2025-07-24 19:17:12.486206 |
| Duration | 17.83 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Race recode (White, Black, Other)
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| White | |
|---|---|
| Black | |
| Other (American Indian/AK Native, Asian/Pacific Islander) | |
| Unknown | 50 |
Length
| Max length | 57 |
|---|---|
| Median length | 5 |
| Mean length | 14.520988 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | Black |
| 3rd row | Black |
| 4th row | White |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| White | 1472 | |
| Black | 465 | 19.1% |
| Other (American Indian/AK Native, Asian/Pacific Islander) | 443 | 18.2% |
| Unknown | 50 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 1472 | |
| black | 465 | 10.0% |
| other | 443 | 9.5% |
| american | 443 | 9.5% |
| indian/ak | 443 | 9.5% |
| native | 443 | 9.5% |
| asian/pacific | 443 | 9.5% |
| islander | 443 | 9.5% |
| unknown | 50 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4130 | 11.7% |
| e | 3244 | 9.2% |
| a | 3123 | 8.9% |
| n | 2365 | 6.7% |
| t | 2358 | 6.7% |
| 2215 | 6.3% | |
| h | 1915 | 5.4% |
| c | 1794 | 5.1% |
| W | 1472 | 4.2% |
| A | 1329 | 3.8% |
| Other values (21) | 11341 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35286 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 4130 | 11.7% |
| e | 3244 | 9.2% |
| a | 3123 | 8.9% |
| n | 2365 | 6.7% |
| t | 2358 | 6.7% |
| 2215 | 6.3% | |
| h | 1915 | 5.4% |
| c | 1794 | 5.1% |
| W | 1472 | 4.2% |
| A | 1329 | 3.8% |
| Other values (21) | 11341 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35286 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 4130 | 11.7% |
| e | 3244 | 9.2% |
| a | 3123 | 8.9% |
| n | 2365 | 6.7% |
| t | 2358 | 6.7% |
| 2215 | 6.3% | |
| h | 1915 | 5.4% |
| c | 1794 | 5.1% |
| W | 1472 | 4.2% |
| A | 1329 | 3.8% |
| Other values (21) | 11341 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35286 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 4130 | 11.7% |
| e | 3244 | 9.2% |
| a | 3123 | 8.9% |
| n | 2365 | 6.7% |
| t | 2358 | 6.7% |
| 2215 | 6.3% | |
| h | 1915 | 5.4% |
| c | 1794 | 5.1% |
| W | 1472 | 4.2% |
| A | 1329 | 3.8% |
| Other values (21) | 11341 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.0411523 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Female |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Female | 1265 | |
| Male | 1165 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 1265 | |
| male | 1165 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3695 | |
| a | 2430 | |
| l | 2430 | |
| F | 1265 | 10.3% |
| m | 1265 | 10.3% |
| M | 1165 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12250 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 3695 | |
| a | 2430 | |
| l | 2430 | |
| F | 1265 | 10.3% |
| m | 1265 | 10.3% |
| M | 1165 | 9.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12250 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 3695 | |
| a | 2430 | |
| l | 2430 | |
| F | 1265 | 10.3% |
| m | 1265 | 10.3% |
| M | 1165 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12250 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 3695 | |
| a | 2430 | |
| l | 2430 | |
| F | 1265 | 10.3% |
| m | 1265 | 10.3% |
| M | 1165 | 9.5% |
Year of diagnosis
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 2022 | |
|---|---|
| 2021 | |
| 2019 | |
| 2020 | |
| 2018 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2022 |
|---|---|
| 2nd row | 2020 |
| 3rd row | 2018 |
| 4th row | 2019 |
| 5th row | 2021 |
Common Values
| Value | Count | Frequency (%) |
| 2022 | 782 | |
| 2021 | 745 | |
| 2019 | 331 | |
| 2020 | 312 | 12.8% |
| 2018 | 260 | 10.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2022 | 782 | |
| 2021 | 745 | |
| 2019 | 331 | |
| 2020 | 312 | 12.8% |
| 2018 | 260 | 10.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 5051 | |
| 0 | 2742 | |
| 1 | 1336 | 13.7% |
| 9 | 331 | 3.4% |
| 8 | 260 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9720 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 5051 | |
| 0 | 2742 | |
| 1 | 1336 | 13.7% |
| 9 | 331 | 3.4% |
| 8 | 260 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9720 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 5051 | |
| 0 | 2742 | |
| 1 | 1336 | 13.7% |
| 9 | 331 | 3.4% |
| 8 | 260 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9720 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 5051 | |
| 0 | 2742 | |
| 1 | 1336 | 13.7% |
| 9 | 331 | 3.4% |
| 8 | 260 | 2.7% |
PRCDA 2020
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Not PRCDA | |
|---|---|
| PRCDA |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 7.2436214 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not PRCDA |
|---|---|
| 2nd row | Not PRCDA |
| 3rd row | Not PRCDA |
| 4th row | Not PRCDA |
| 5th row | Not PRCDA |
Common Values
| Value | Count | Frequency (%) |
| Not PRCDA | 1363 | |
| PRCDA | 1067 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| prcda | 2430 | |
| not | 1363 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 2430 | |
| A | 2430 | |
| D | 2430 | |
| C | 2430 | |
| R | 2430 | |
| N | 1363 | |
| 1363 | ||
| o | 1363 | |
| t | 1363 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17602 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 2430 | |
| A | 2430 | |
| D | 2430 | |
| C | 2430 | |
| R | 2430 | |
| N | 1363 | |
| 1363 | ||
| o | 1363 | |
| t | 1363 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17602 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 2430 | |
| A | 2430 | |
| D | 2430 | |
| C | 2430 | |
| R | 2430 | |
| N | 1363 | |
| 1363 | ||
| o | 1363 | |
| t | 1363 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17602 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 2430 | |
| A | 2430 | |
| D | 2430 | |
| C | 2430 | |
| R | 2430 | |
| N | 1363 | |
| 1363 | ||
| o | 1363 | |
| t | 1363 |
Site recode ICD-O-3 2023 Revision Expanded
Categorical
High correlation  Imbalance 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Stomach | |
|---|---|
| Small Intestine | |
| Colon And Rectum (Excluding Appendix) | 98 |
| Digestive Other | 76 |
| Retroperitoneum And Peritoneum | 29 |
| Other values (9) | 41 |
Length
| Max length | 37 |
|---|---|
| Median length | 7 |
| Mean length | 10.618519 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Stomach |
|---|---|
| 2nd row | Small Intestine |
| 3rd row | Small Intestine |
| 4th row | Stomach |
| 5th row | Stomach |
Common Values
| Value | Count | Frequency (%) |
| Stomach | 1651 | |
| Small Intestine | 535 | 22.0% |
| Colon And Rectum (Excluding Appendix) | 98 | 4.0% |
| Digestive Other | 76 | 3.1% |
| Retroperitoneum And Peritoneum | 29 | 1.2% |
| Esophagus | 13 | 0.5% |
| Miscellaneous Neoplasms | 12 | 0.5% |
| Appendix | 5 | 0.2% |
| Soft Tissue | 5 | 0.2% |
| Pancreas | 2 | 0.1% |
| Other values (4) | 4 | 0.2% |
Length
| Value | Count | Frequency (%) |
| stomach | 1651 | |
| small | 535 | 15.2% |
| intestine | 535 | 15.2% |
| and | 130 | 3.7% |
| appendix | 103 | 2.9% |
| colon | 98 | 2.8% |
| rectum | 98 | 2.8% |
| excluding | 98 | 2.8% |
| digestive | 76 | 2.2% |
| other | 76 | 2.2% |
| Other values (18) | 117 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3066 | |
| m | 2356 | |
| a | 2233 | 8.7% |
| S | 2191 | 8.5% |
| o | 1978 | 7.7% |
| c | 1863 | 7.2% |
| h | 1741 | 6.7% |
| e | 1692 | 6.6% |
| n | 1578 | 6.1% |
| l | 1305 | 5.1% |
| Other values (28) | 5800 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25803 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 3066 | |
| m | 2356 | |
| a | 2233 | 8.7% |
| S | 2191 | 8.5% |
| o | 1978 | 7.7% |
| c | 1863 | 7.2% |
| h | 1741 | 6.7% |
| e | 1692 | 6.6% |
| n | 1578 | 6.1% |
| l | 1305 | 5.1% |
| Other values (28) | 5800 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25803 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 3066 | |
| m | 2356 | |
| a | 2233 | 8.7% |
| S | 2191 | 8.5% |
| o | 1978 | 7.7% |
| c | 1863 | 7.2% |
| h | 1741 | 6.7% |
| e | 1692 | 6.6% |
| n | 1578 | 6.1% |
| l | 1305 | 5.1% |
| Other values (28) | 5800 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25803 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 3066 | |
| m | 2356 | |
| a | 2233 | 8.7% |
| S | 2191 | 8.5% |
| o | 1978 | 7.7% |
| c | 1863 | 7.2% |
| h | 1741 | 6.7% |
| e | 1692 | 6.6% |
| n | 1578 | 6.1% |
| l | 1305 | 5.1% |
| Other values (28) | 5800 |
Primary Site - labeled
Categorical
High correlation 
| Distinct | 45 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| C16.9-Stomach, NOS | |
|---|---|
| C16.1-Fundus of stomach | |
| C16.6-Greater curvature of stomach NOS | |
| C16.2-Body of stomach | |
| C17.9-Small intestine, NOS | |
| Other values (40) |
Length
| Max length | 56 |
|---|---|
| Median length | 44 |
| Mean length | 23.500823 |
| Min length | 11 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | C16.3-Gastric antrum |
|---|---|
| 2nd row | C17.1-Jejunum |
| 3rd row | C17.9-Small intestine, NOS |
| 4th row | C16.6-Greater curvature of stomach NOS |
| 5th row | C16.1-Fundus of stomach |
Common Values
| Value | Count | Frequency (%) |
| C16.9-Stomach, NOS | 478 | |
| C16.1-Fundus of stomach | 277 | |
| C16.6-Greater curvature of stomach NOS | 238 | |
| C16.2-Body of stomach | 206 | |
| C17.9-Small intestine, NOS | 186 | 7.7% |
| C16.5-Lesser curvature of stomach NOS | 185 | 7.6% |
| C17.1-Jejunum | 157 | 6.5% |
| C17.0-Duodenum | 140 | 5.8% |
| C16.3-Gastric antrum | 111 | 4.6% |
| C16.0-Cardia, NOS | 103 | 4.2% |
| Other values (35) | 349 |
Length
| Value | Count | Frequency (%) |
| nos | 1327 | |
| of | 1007 | |
| stomach | 956 | |
| c16.9-stomach | 478 | 7.1% |
| curvature | 423 | 6.3% |
| c16.1-fundus | 277 | 4.1% |
| c16.6-greater | 238 | 3.5% |
| c16.2-body | 206 | 3.0% |
| intestine | 197 | 2.9% |
| c17.9-small | 186 | 2.8% |
| Other values (67) | 1464 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4329 | 7.6% | |
| t | 3246 | 5.7% |
| o | 3122 | 5.5% |
| a | 3072 | 5.4% |
| 1 | 2710 | 4.7% |
| C | 2549 | 4.5% |
| e | 2491 | 4.4% |
| - | 2430 | 4.3% |
| . | 2430 | 4.3% |
| u | 2280 | 4.0% |
| Other values (49) | 28448 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 57107 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4329 | 7.6% | |
| t | 3246 | 5.7% |
| o | 3122 | 5.5% |
| a | 3072 | 5.4% |
| 1 | 2710 | 4.7% |
| C | 2549 | 4.5% |
| e | 2491 | 4.4% |
| - | 2430 | 4.3% |
| . | 2430 | 4.3% |
| u | 2280 | 4.0% |
| Other values (49) | 28448 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 57107 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4329 | 7.6% | |
| t | 3246 | 5.7% |
| o | 3122 | 5.5% |
| a | 3072 | 5.4% |
| 1 | 2710 | 4.7% |
| C | 2549 | 4.5% |
| e | 2491 | 4.4% |
| - | 2430 | 4.3% |
| . | 2430 | 4.3% |
| u | 2280 | 4.0% |
| Other values (49) | 28448 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 57107 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4329 | 7.6% | |
| t | 3246 | 5.7% |
| o | 3122 | 5.5% |
| a | 3072 | 5.4% |
| 1 | 2710 | 4.7% |
| C | 2549 | 4.5% |
| e | 2491 | 4.4% |
| - | 2430 | 4.3% |
| . | 2430 | 4.3% |
| u | 2280 | 4.0% |
| Other values (49) | 28448 |
Primary Site
Real number (ℝ)
High correlation 
| Distinct | 45 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 179.22346 |
| Minimum | 154 |
|---|---|
| Maximum | 809 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.0 KiB |
Quantile statistics
| Minimum | 154 |
|---|---|
| 5-th percentile | 161 |
| Q1 | 163 |
| median | 169 |
| Q3 | 171 |
| 95-th percentile | 269 |
| Maximum | 809 |
| Range | 655 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 60.416237 |
|---|---|
| Coefficient of variation (CV) | 0.33710006 |
| Kurtosis | 60.871169 |
| Mean | 179.22346 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 7.2221934 |
| Sum | 435513 |
| Variance | 3650.1217 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 169 | 478 | |
| 161 | 277 | |
| 166 | 238 | |
| 162 | 206 | |
| 179 | 186 | 7.7% |
| 165 | 185 | 7.6% |
| 171 | 157 | 6.5% |
| 170 | 140 | 5.8% |
| 163 | 111 | 4.6% |
| 160 | 103 | 4.2% |
| Other values (35) | 349 |
| Value | Count | Frequency (%) |
| 154 | 3 | 0.1% |
| 155 | 9 | 0.4% |
| 159 | 1 | < 0.1% |
| 160 | 103 | 4.2% |
| 161 | 277 | |
| 162 | 206 | |
| 163 | 111 | |
| 164 | 3 | 0.1% |
| 165 | 185 | |
| 166 | 238 |
| Value | Count | Frequency (%) |
| 809 | 9 | |
| 763 | 1 | < 0.1% |
| 762 | 2 | 0.1% |
| 495 | 4 | 0.2% |
| 494 | 1 | < 0.1% |
| 488 | 2 | 0.1% |
| 482 | 7 | |
| 481 | 14 | |
| 480 | 6 | |
| 382 | 1 | < 0.1% |
Derived Summary Grade 2018 (2018+)
Categorical
High correlation 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| L | |
|---|---|
| 9 | |
| H | |
| A | 24 |
| C | 5 |
| Other values (2) | 6 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | L |
|---|---|
| 2nd row | L |
| 3rd row | 9 |
| 4th row | L |
| 5th row | 9 |
Common Values
| Value | Count | Frequency (%) |
| L | 1245 | |
| 9 | 855 | |
| H | 295 | 12.1% |
| A | 24 | 1.0% |
| C | 5 | 0.2% |
| B | 4 | 0.2% |
| D | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| l | 1245 | |
| 9 | 855 | |
| h | 295 | 12.1% |
| a | 24 | 1.0% |
| c | 5 | 0.2% |
| b | 4 | 0.2% |
| d | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 1245 | |
| 9 | 855 | |
| H | 295 | 12.1% |
| A | 24 | 1.0% |
| C | 5 | 0.2% |
| B | 4 | 0.2% |
| D | 2 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| L | 1245 | |
| 9 | 855 | |
| H | 295 | 12.1% |
| A | 24 | 1.0% |
| C | 5 | 0.2% |
| B | 4 | 0.2% |
| D | 2 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| L | 1245 | |
| 9 | 855 | |
| H | 295 | 12.1% |
| A | 24 | 1.0% |
| C | 5 | 0.2% |
| B | 4 | 0.2% |
| D | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| L | 1245 | |
| 9 | 855 | |
| H | 295 | 12.1% |
| A | 24 | 1.0% |
| C | 5 | 0.2% |
| B | 4 | 0.2% |
| D | 2 | 0.1% |
Grade Clinical (2018+)
Categorical
High correlation  Imbalance 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 9 | |
|---|---|
| L | |
| H | 69 |
| A | 6 |
| C | 4 |
| Other values (2) | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 9 |
| 3rd row | 9 |
| 4th row | 9 |
| 5th row | 9 |
Common Values
| Value | Count | Frequency (%) |
| 9 | 2007 | |
| L | 341 | 14.0% |
| H | 69 | 2.8% |
| A | 6 | 0.2% |
| C | 4 | 0.2% |
| D | 2 | 0.1% |
| B | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 9 | 2007 | |
| l | 341 | 14.0% |
| h | 69 | 2.8% |
| a | 6 | 0.2% |
| c | 4 | 0.2% |
| d | 2 | 0.1% |
| b | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 2007 | |
| L | 341 | 14.0% |
| H | 69 | 2.8% |
| A | 6 | 0.2% |
| C | 4 | 0.2% |
| D | 2 | 0.1% |
| B | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9 | 2007 | |
| L | 341 | 14.0% |
| H | 69 | 2.8% |
| A | 6 | 0.2% |
| C | 4 | 0.2% |
| D | 2 | 0.1% |
| B | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9 | 2007 | |
| L | 341 | 14.0% |
| H | 69 | 2.8% |
| A | 6 | 0.2% |
| C | 4 | 0.2% |
| D | 2 | 0.1% |
| B | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9 | 2007 | |
| L | 341 | 14.0% |
| H | 69 | 2.8% |
| A | 6 | 0.2% |
| C | 4 | 0.2% |
| D | 2 | 0.1% |
| B | 1 | < 0.1% |
Grade Pathological (2018+)
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 9 | |
|---|---|
| L | |
| H | |
| A | 19 |
| B | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | L |
|---|---|
| 2nd row | L |
| 3rd row | 9 |
| 4th row | L |
| 5th row | 9 |
Common Values
| Value | Count | Frequency (%) |
| 9 | 1089 | |
| L | 1073 | |
| H | 243 | 10.0% |
| A | 19 | 0.8% |
| B | 4 | 0.2% |
| C | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 9 | 1089 | |
| l | 1073 | |
| h | 243 | 10.0% |
| a | 19 | 0.8% |
| b | 4 | 0.2% |
| c | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 1089 | |
| L | 1073 | |
| H | 243 | 10.0% |
| A | 19 | 0.8% |
| B | 4 | 0.2% |
| C | 2 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1089 | |
| L | 1073 | |
| H | 243 | 10.0% |
| A | 19 | 0.8% |
| B | 4 | 0.2% |
| C | 2 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1089 | |
| L | 1073 | |
| H | 243 | 10.0% |
| A | 19 | 0.8% |
| B | 4 | 0.2% |
| C | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1089 | |
| L | 1073 | |
| H | 243 | 10.0% |
| A | 19 | 0.8% |
| B | 4 | 0.2% |
| C | 2 | 0.1% |
Diagnostic Confirmation
Categorical
Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Positive histology | |
|---|---|
| Positive exfoliative cytology, no positive histology | 81 |
| Radiography without microscopic confirm | 15 |
| Unknown | 5 |
| Clinical diagnosis only | 4 |
Length
| Max length | 53 |
|---|---|
| Median length | 18 |
| Mean length | 19.277366 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Positive histology |
|---|---|
| 2nd row | Positive histology |
| 3rd row | Positive histology |
| 4th row | Positive histology |
| 5th row | Positive histology |
Common Values
| Value | Count | Frequency (%) |
| Positive histology | 2323 | |
| Positive exfoliative cytology, no positive histology | 81 | 3.3% |
| Radiography without microscopic confirm | 15 | 0.6% |
| Unknown | 5 | 0.2% |
| Clinical diagnosis only | 4 | 0.2% |
| Direct visualization without microscopic confirmation | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| positive | 2485 | |
| histology | 2404 | |
| exfoliative | 81 | 1.6% |
| cytology | 81 | 1.6% |
| no | 81 | 1.6% |
| without | 17 | 0.3% |
| microscopic | 17 | 0.3% |
| radiography | 15 | 0.3% |
| confirm | 15 | 0.3% |
| unknown | 5 | 0.1% |
| Other values (6) | 18 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 7717 | |
| i | 7645 | |
| t | 5091 | |
| s | 4916 | |
| 2789 | 6.0% | |
| e | 2649 | 5.7% |
| y | 2585 | 5.5% |
| l | 2580 | 5.5% |
| v | 2568 | 5.5% |
| g | 2504 | 5.3% |
| Other values (20) | 5800 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 46844 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 7717 | |
| i | 7645 | |
| t | 5091 | |
| s | 4916 | |
| 2789 | 6.0% | |
| e | 2649 | 5.7% |
| y | 2585 | 5.5% |
| l | 2580 | 5.5% |
| v | 2568 | 5.5% |
| g | 2504 | 5.3% |
| Other values (20) | 5800 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 46844 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 7717 | |
| i | 7645 | |
| t | 5091 | |
| s | 4916 | |
| 2789 | 6.0% | |
| e | 2649 | 5.7% |
| y | 2585 | 5.5% |
| l | 2580 | 5.5% |
| v | 2568 | 5.5% |
| g | 2504 | 5.3% |
| Other values (20) | 5800 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 46844 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 7717 | |
| i | 7645 | |
| t | 5091 | |
| s | 4916 | |
| 2789 | 6.0% | |
| e | 2649 | 5.7% |
| y | 2585 | 5.5% |
| l | 2580 | 5.5% |
| v | 2568 | 5.5% |
| g | 2504 | 5.3% |
| Other values (20) | 5800 |
AJCC ID (2018+)
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| GIST: Gastric and Omental | |
|---|---|
| GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal | |
| No AJCC Chapter | 101 |
Length
| Max length | 74 |
|---|---|
| Median length | 25 |
| Mean length | 38.235802 |
| Min length | 15 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GIST: Gastric and Omental |
|---|---|
| 2nd row | GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal |
| 3rd row | GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal |
| 4th row | GIST: Gastric and Omental |
| 5th row | GIST: Gastric and Omental |
Common Values
| Value | Count | Frequency (%) |
| GIST: Gastric and Omental | 1652 | |
| GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal | 677 | |
| No AJCC Chapter | 101 | 4.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| gist | 2329 | |
| and | 2329 | |
| gastric | 1652 | |
| omental | 1652 | |
| small | 677 | 5.5% |
| intestinal | 677 | 5.5% |
| esophageal | 677 | 5.5% |
| colorectal | 677 | 5.5% |
| mesenteric | 677 | 5.5% |
| peritoneal | 677 | 5.5% |
| Other values (3) | 303 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9897 | 10.7% | |
| a | 9796 | 10.5% |
| e | 7169 | 7.7% |
| t | 6790 | 7.3% |
| n | 6689 | 7.2% |
| l | 6391 | 6.9% |
| G | 3981 | 4.3% |
| r | 3784 | 4.1% |
| s | 3683 | 4.0% |
| i | 3683 | 4.0% |
| Other values (20) | 31050 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 92913 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9897 | 10.7% | |
| a | 9796 | 10.5% |
| e | 7169 | 7.7% |
| t | 6790 | 7.3% |
| n | 6689 | 7.2% |
| l | 6391 | 6.9% |
| G | 3981 | 4.3% |
| r | 3784 | 4.1% |
| s | 3683 | 4.0% |
| i | 3683 | 4.0% |
| Other values (20) | 31050 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 92913 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9897 | 10.7% | |
| a | 9796 | 10.5% |
| e | 7169 | 7.7% |
| t | 6790 | 7.3% |
| n | 6689 | 7.2% |
| l | 6391 | 6.9% |
| G | 3981 | 4.3% |
| r | 3784 | 4.1% |
| s | 3683 | 4.0% |
| i | 3683 | 4.0% |
| Other values (20) | 31050 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 92913 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9897 | 10.7% | |
| a | 9796 | 10.5% |
| e | 7169 | 7.7% |
| t | 6790 | 7.3% |
| n | 6689 | 7.2% |
| l | 6391 | 6.9% |
| G | 3981 | 4.3% |
| r | 3784 | 4.1% |
| s | 3683 | 4.0% |
| i | 3683 | 4.0% |
| Other values (20) | 31050 |
Derived EOD 2018 T Recode (2018+)
Categorical
High correlation 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| T2 | |
|---|---|
| T1 | |
| T3 | |
| T4 | |
| TX | |
| Other values (2) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | T2 |
|---|---|
| 2nd row | T2 |
| 3rd row | T2 |
| 4th row | T2 |
| 5th row | T2 |
Common Values
| Value | Count | Frequency (%) |
| T2 | 807 | |
| T1 | 507 | |
| T3 | 487 | |
| T4 | 297 | 12.2% |
| TX | 228 | 9.4% |
| 88 | 101 | 4.2% |
| T0 | 3 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| t2 | 807 | |
| t1 | 507 | |
| t3 | 487 | |
| t4 | 297 | 12.2% |
| tx | 228 | 9.4% |
| 88 | 101 | 4.2% |
| t0 | 3 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 2329 | |
| 2 | 807 | 16.6% |
| 1 | 507 | 10.4% |
| 3 | 487 | 10.0% |
| 4 | 297 | 6.1% |
| X | 228 | 4.7% |
| 8 | 202 | 4.2% |
| 0 | 3 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| T | 2329 | |
| 2 | 807 | 16.6% |
| 1 | 507 | 10.4% |
| 3 | 487 | 10.0% |
| 4 | 297 | 6.1% |
| X | 228 | 4.7% |
| 8 | 202 | 4.2% |
| 0 | 3 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| T | 2329 | |
| 2 | 807 | 16.6% |
| 1 | 507 | 10.4% |
| 3 | 487 | 10.0% |
| 4 | 297 | 6.1% |
| X | 228 | 4.7% |
| 8 | 202 | 4.2% |
| 0 | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| T | 2329 | |
| 2 | 807 | 16.6% |
| 1 | 507 | 10.4% |
| 3 | 487 | 10.0% |
| 4 | 297 | 6.1% |
| X | 228 | 4.7% |
| 8 | 202 | 4.2% |
| 0 | 3 | 0.1% |
Derived EOD 2018 N Recode (2018+)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| N0 | |
|---|---|
| 88 | 101 |
| N1 | 67 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | N0 |
|---|---|
| 2nd row | N0 |
| 3rd row | N0 |
| 4th row | N0 |
| 5th row | N0 |
Common Values
| Value | Count | Frequency (%) |
| N0 | 2262 | |
| 88 | 101 | 4.2% |
| N1 | 67 | 2.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n0 | 2262 | |
| 88 | 101 | 4.2% |
| n1 | 67 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2329 | |
| 0 | 2262 | |
| 8 | 202 | 4.2% |
| 1 | 67 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 2329 | |
| 0 | 2262 | |
| 8 | 202 | 4.2% |
| 1 | 67 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 2329 | |
| 0 | 2262 | |
| 8 | 202 | 4.2% |
| 1 | 67 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 2329 | |
| 0 | 2262 | |
| 8 | 202 | 4.2% |
| 1 | 67 | 1.4% |
Derived EOD 2018 M Recode (2018+)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| M0 | |
|---|---|
| M1 | |
| 88 | 101 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M0 |
|---|---|
| 2nd row | M0 |
| 3rd row | M0 |
| 4th row | M0 |
| 5th row | M0 |
Common Values
| Value | Count | Frequency (%) |
| M0 | 2088 | |
| M1 | 241 | 9.9% |
| 88 | 101 | 4.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m0 | 2088 | |
| m1 | 241 | 9.9% |
| 88 | 101 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 2329 | |
| 0 | 2088 | |
| 1 | 241 | 5.0% |
| 8 | 202 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 2329 | |
| 0 | 2088 | |
| 1 | 241 | 5.0% |
| 8 | 202 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 2329 | |
| 0 | 2088 | |
| 1 | 241 | 5.0% |
| 8 | 202 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 2329 | |
| 0 | 2088 | |
| 1 | 241 | 5.0% |
| 8 | 202 | 4.2% |
Derived EOD 2018 Stage Group Recode (2018+)
Categorical
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 99 | |
|---|---|
| 1A | |
| 4 | |
| 1 | |
| 2 | |
| Other values (5) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.7234568 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1A |
|---|---|
| 2nd row | 1 |
| 3rd row | 99 |
| 4th row | 1A |
| 5th row | 99 |
Common Values
| Value | Count | Frequency (%) |
| 99 | 678 | |
| 1A | 656 | |
| 4 | 282 | |
| 1 | 203 | 8.4% |
| 2 | 176 | 7.2% |
| 1B | 136 | 5.6% |
| 3B | 114 | 4.7% |
| 88 | 101 | 4.2% |
| 3A | 73 | 3.0% |
| 3 | 11 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 99 | 678 | |
| 1a | 656 | |
| 4 | 282 | |
| 1 | 203 | 8.4% |
| 2 | 176 | 7.2% |
| 1b | 136 | 5.6% |
| 3b | 114 | 4.7% |
| 88 | 101 | 4.2% |
| 3a | 73 | 3.0% |
| 3 | 11 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 1356 | |
| 1 | 995 | |
| A | 729 | |
| 4 | 282 | 6.7% |
| B | 250 | 6.0% |
| 8 | 202 | 4.8% |
| 3 | 198 | 4.7% |
| 2 | 176 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4188 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1356 | |
| 1 | 995 | |
| A | 729 | |
| 4 | 282 | 6.7% |
| B | 250 | 6.0% |
| 8 | 202 | 4.8% |
| 3 | 198 | 4.7% |
| 2 | 176 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4188 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1356 | |
| 1 | 995 | |
| A | 729 | |
| 4 | 282 | 6.7% |
| B | 250 | 6.0% |
| 8 | 202 | 4.8% |
| 3 | 198 | 4.7% |
| 2 | 176 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4188 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1356 | |
| 1 | 995 | |
| A | 729 | |
| 4 | 282 | 6.7% |
| B | 250 | 6.0% |
| 8 | 202 | 4.8% |
| 3 | 198 | 4.7% |
| 2 | 176 | 4.2% |
RX Summ--Surg Prim Site (1998+)
Categorical
High correlation 
| Distinct | 28 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 30 | |
|---|---|
| 00 | |
| 27 | |
| 33 | |
| 32 | 83 |
| Other values (23) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 30 |
|---|---|
| 2nd row | 30 |
| 3rd row | 30 |
| 4th row | 30 |
| 5th row | 30 |
Common Values
| Value | Count | Frequency (%) |
| 30 | 1063 | |
| 00 | 635 | |
| 27 | 141 | 5.8% |
| 33 | 116 | 4.8% |
| 32 | 83 | 3.4% |
| 20 | 69 | 2.8% |
| 51 | 53 | 2.2% |
| 60 | 51 | 2.1% |
| 61 | 47 | 1.9% |
| 40 | 41 | 1.7% |
| Other values (18) | 131 | 5.4% |
Length
| Value | Count | Frequency (%) |
| 30 | 1063 | |
| 00 | 635 | |
| 27 | 141 | 5.8% |
| 33 | 116 | 4.8% |
| 32 | 83 | 3.4% |
| 20 | 69 | 2.8% |
| 51 | 53 | 2.2% |
| 60 | 51 | 2.1% |
| 61 | 47 | 1.9% |
| 40 | 41 | 1.7% |
| Other values (18) | 131 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2526 | |
| 3 | 1407 | |
| 2 | 332 | 6.8% |
| 7 | 142 | 2.9% |
| 1 | 131 | 2.7% |
| 6 | 114 | 2.3% |
| 5 | 79 | 1.6% |
| 4 | 60 | 1.2% |
| 9 | 54 | 1.1% |
| 8 | 15 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2526 | |
| 3 | 1407 | |
| 2 | 332 | 6.8% |
| 7 | 142 | 2.9% |
| 1 | 131 | 2.7% |
| 6 | 114 | 2.3% |
| 5 | 79 | 1.6% |
| 4 | 60 | 1.2% |
| 9 | 54 | 1.1% |
| 8 | 15 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2526 | |
| 3 | 1407 | |
| 2 | 332 | 6.8% |
| 7 | 142 | 2.9% |
| 1 | 131 | 2.7% |
| 6 | 114 | 2.3% |
| 5 | 79 | 1.6% |
| 4 | 60 | 1.2% |
| 9 | 54 | 1.1% |
| 8 | 15 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2526 | |
| 3 | 1407 | |
| 2 | 332 | 6.8% |
| 7 | 142 | 2.9% |
| 1 | 131 | 2.7% |
| 6 | 114 | 2.3% |
| 5 | 79 | 1.6% |
| 4 | 60 | 1.2% |
| 9 | 54 | 1.1% |
| 8 | 15 | 0.3% |
RX Summ--Scope Reg LN Sur (2003+)
Categorical
High correlation  Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 1875 |
| Missing (%) | 77.2% |
| Memory size | 38.0 KiB |
| 4 or more regional lymph nodes removed | |
|---|---|
| 1 to 3 regional lymph nodes removed | |
| Unknown or not applicable | |
| Biopsy or aspiration of regional lymph node, NOS | 10 |
| Number of regional lymph nodes removed unknown | 3 |
Length
| Max length | 48 |
|---|---|
| Median length | 46 |
| Mean length | 35.636036 |
| Min length | 25 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 4 or more regional lymph nodes removed |
|---|---|
| 2nd row | 1 to 3 regional lymph nodes removed |
| 3rd row | 4 or more regional lymph nodes removed |
| 4th row | 4 or more regional lymph nodes removed |
| 5th row | Unknown or not applicable |
Common Values
| Value | Count | Frequency (%) |
| 4 or more regional lymph nodes removed | 273 | 11.2% |
| 1 to 3 regional lymph nodes removed | 206 | 8.5% |
| Unknown or not applicable | 62 | 2.6% |
| Biopsy or aspiration of regional lymph node, NOS | 10 | 0.4% |
| Number of regional lymph nodes removed unknown | 3 | 0.1% |
| Sentinel lymph node biopsy | 1 | < 0.1% |
| (Missing) | 1875 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| lymph | 493 | |
| regional | 492 | |
| nodes | 482 | |
| removed | 482 | |
| or | 345 | |
| 4 | 273 | |
| more | 273 | |
| 1 | 206 | |
| to | 206 | |
| 3 | 206 | |
| Other values (10) | 248 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3151 | ||
| o | 2452 | |
| e | 2289 | |
| r | 1605 | 8.1% |
| n | 1254 | 6.3% |
| m | 1251 | 6.3% |
| l | 1110 | 5.6% |
| d | 975 | 4.9% |
| p | 638 | 3.2% |
| a | 636 | 3.2% |
| Other values (22) | 4417 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19778 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3151 | ||
| o | 2452 | |
| e | 2289 | |
| r | 1605 | 8.1% |
| n | 1254 | 6.3% |
| m | 1251 | 6.3% |
| l | 1110 | 5.6% |
| d | 975 | 4.9% |
| p | 638 | 3.2% |
| a | 636 | 3.2% |
| Other values (22) | 4417 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19778 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3151 | ||
| o | 2452 | |
| e | 2289 | |
| r | 1605 | 8.1% |
| n | 1254 | 6.3% |
| m | 1251 | 6.3% |
| l | 1110 | 5.6% |
| d | 975 | 4.9% |
| p | 638 | 3.2% |
| a | 636 | 3.2% |
| Other values (22) | 4417 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19778 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3151 | ||
| o | 2452 | |
| e | 2289 | |
| r | 1605 | 8.1% |
| n | 1254 | 6.3% |
| m | 1251 | 6.3% |
| l | 1110 | 5.6% |
| d | 975 | 4.9% |
| p | 638 | 3.2% |
| a | 636 | 3.2% |
| Other values (22) | 4417 |
RX Summ--Surg Oth Reg/Dis (2003+)
Categorical
Imbalance 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| None; diagnosed at autopsy | |
|---|---|
| Non-primary surgical procedure to other regional sites | 48 |
| Non-primary surgical procedure to distant site | 47 |
| Unknown; death certificate only | 21 |
| Non-primary surgical procedure performed | 16 |
| Other values (2) | 7 |
Length
| Max length | 60 |
|---|---|
| Median length | 26 |
| Mean length | 27.169136 |
| Min length | 26 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None; diagnosed at autopsy |
|---|---|
| 2nd row | None; diagnosed at autopsy |
| 3rd row | None; diagnosed at autopsy |
| 4th row | None; diagnosed at autopsy |
| 5th row | None; diagnosed at autopsy |
Common Values
| Value | Count | Frequency (%) |
| None; diagnosed at autopsy | 2291 | |
| Non-primary surgical procedure to other regional sites | 48 | 2.0% |
| Non-primary surgical procedure to distant site | 47 | 1.9% |
| Unknown; death certificate only | 21 | 0.9% |
| Non-primary surgical procedure performed | 16 | 0.7% |
| Any combo of sur proc to oth rg, dis lym nd, and/or dis site | 5 | 0.2% |
| Non-primary surgical procedure to distant lymph node(s) | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| none | 2291 | |
| diagnosed | 2291 | |
| at | 2291 | |
| autopsy | 2291 | |
| non-primary | 113 | 1.1% |
| surgical | 113 | 1.1% |
| procedure | 113 | 1.1% |
| to | 102 | 1.0% |
| site | 52 | 0.5% |
| distant | 49 | 0.5% |
| Other values (21) | 308 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7584 | ||
| o | 7387 | |
| a | 7243 | |
| e | 5101 | 7.7% |
| t | 4998 | 7.6% |
| s | 4909 | 7.4% |
| n | 4893 | 7.4% |
| d | 4803 | 7.3% |
| i | 2766 | 4.2% |
| p | 2540 | 3.8% |
| Other values (21) | 13797 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66021 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7584 | ||
| o | 7387 | |
| a | 7243 | |
| e | 5101 | 7.7% |
| t | 4998 | 7.6% |
| s | 4909 | 7.4% |
| n | 4893 | 7.4% |
| d | 4803 | 7.3% |
| i | 2766 | 4.2% |
| p | 2540 | 3.8% |
| Other values (21) | 13797 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66021 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7584 | ||
| o | 7387 | |
| a | 7243 | |
| e | 5101 | 7.7% |
| t | 4998 | 7.6% |
| s | 4909 | 7.4% |
| n | 4893 | 7.4% |
| d | 4803 | 7.3% |
| i | 2766 | 4.2% |
| p | 2540 | 3.8% |
| Other values (21) | 13797 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66021 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7584 | ||
| o | 7387 | |
| a | 7243 | |
| e | 5101 | 7.7% |
| t | 4998 | 7.6% |
| s | 4909 | 7.4% |
| n | 4893 | 7.4% |
| d | 4803 | 7.3% |
| i | 2766 | 4.2% |
| p | 2540 | 3.8% |
| Other values (21) | 13797 |
RX Summ--Surg/Rad Seq
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| No radiation and/or no surgery; unknown if surgery and/or radiation given | |
|---|---|
| Radiation after surgery | 2 |
| Radiation prior to surgery | 2 |
Length
| Max length | 73 |
|---|---|
| Median length | 73 |
| Mean length | 72.920165 |
| Min length | 23 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No radiation and/or no surgery; unknown if surgery and/or radiation given |
|---|---|
| 2nd row | No radiation and/or no surgery; unknown if surgery and/or radiation given |
| 3rd row | No radiation and/or no surgery; unknown if surgery and/or radiation given |
| 4th row | No radiation and/or no surgery; unknown if surgery and/or radiation given |
| 5th row | No radiation and/or no surgery; unknown if surgery and/or radiation given |
Common Values
| Value | Count | Frequency (%) |
| No radiation and/or no surgery; unknown if surgery and/or radiation given | 2426 | |
| Radiation after surgery | 2 | 0.1% |
| Radiation prior to surgery | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| radiation | 4856 | |
| surgery | 4856 | |
| no | 4852 | |
| and/or | 4852 | |
| unknown | 2426 | |
| if | 2426 | |
| given | 2426 | |
| after | 2 | < 0.1% |
| prior | 2 | < 0.1% |
| to | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 24270 | ||
| n | 21838 | |
| r | 19422 | |
| o | 16990 | |
| i | 14566 | |
| a | 14566 | |
| d | 9708 | 5.5% |
| e | 7284 | 4.1% |
| g | 7282 | 4.1% |
| u | 7282 | 4.1% |
| Other values (12) | 33988 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 177196 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 24270 | ||
| n | 21838 | |
| r | 19422 | |
| o | 16990 | |
| i | 14566 | |
| a | 14566 | |
| d | 9708 | 5.5% |
| e | 7284 | 4.1% |
| g | 7282 | 4.1% |
| u | 7282 | 4.1% |
| Other values (12) | 33988 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 177196 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 24270 | ||
| n | 21838 | |
| r | 19422 | |
| o | 16990 | |
| i | 14566 | |
| a | 14566 | |
| d | 9708 | 5.5% |
| e | 7284 | 4.1% |
| g | 7282 | 4.1% |
| u | 7282 | 4.1% |
| Other values (12) | 33988 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 177196 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 24270 | ||
| n | 21838 | |
| r | 19422 | |
| o | 16990 | |
| i | 14566 | |
| a | 14566 | |
| d | 9708 | 5.5% |
| e | 7284 | 4.1% |
| g | 7282 | 4.1% |
| u | 7282 | 4.1% |
| Other values (12) | 33988 |
Reason no cancer-directed surgery
Categorical
High correlation  Imbalance 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Surgery performed | |
|---|---|
| Not recommended | |
| Not recommended, contraindicated due to other cond; autopsy only (1973-2002) | 40 |
| Recommended but not performed, patient refused | 29 |
| Recommended, unknown if performed | 27 |
| Other values (3) | 43 |
Length
| Max length | 76 |
|---|---|
| Median length | 17 |
| Mean length | 18.627984 |
| Min length | 15 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Surgery performed |
|---|---|
| 2nd row | Surgery performed |
| 3rd row | Surgery performed |
| 4th row | Surgery performed |
| 5th row | Surgery performed |
Common Values
| Value | Count | Frequency (%) |
| Surgery performed | 1766 | |
| Not recommended | 525 | 21.6% |
| Not recommended, contraindicated due to other cond; autopsy only (1973-2002) | 40 | 1.6% |
| Recommended but not performed, patient refused | 29 | 1.2% |
| Recommended, unknown if performed | 27 | 1.1% |
| Recommended but not performed, unknown reason | 19 | 0.8% |
| Unknown; death certificate; or autopsy only (2003+) | 19 | 0.8% |
| Not performed, patient died prior to recommended surgery | 5 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| performed | 1846 | |
| surgery | 1771 | |
| recommended | 645 | 11.6% |
| not | 618 | 11.1% |
| unknown | 65 | 1.2% |
| autopsy | 59 | 1.1% |
| only | 59 | 1.1% |
| but | 48 | 0.9% |
| to | 45 | 0.8% |
| other | 40 | 0.7% |
| Other values (14) | 355 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 7980 | |
| e | 7691 | |
| o | 3500 | |
| d | 3354 | 7.4% |
| m | 3136 | 6.9% |
| 3121 | 6.9% | |
| u | 1993 | 4.4% |
| p | 1944 | 4.3% |
| f | 1921 | 4.2% |
| y | 1889 | 4.2% |
| Other values (28) | 8737 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 45266 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 7980 | |
| e | 7691 | |
| o | 3500 | |
| d | 3354 | 7.4% |
| m | 3136 | 6.9% |
| 3121 | 6.9% | |
| u | 1993 | 4.4% |
| p | 1944 | 4.3% |
| f | 1921 | 4.2% |
| y | 1889 | 4.2% |
| Other values (28) | 8737 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 45266 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 7980 | |
| e | 7691 | |
| o | 3500 | |
| d | 3354 | 7.4% |
| m | 3136 | 6.9% |
| 3121 | 6.9% | |
| u | 1993 | 4.4% |
| p | 1944 | 4.3% |
| f | 1921 | 4.2% |
| y | 1889 | 4.2% |
| Other values (28) | 8737 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 45266 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 7980 | |
| e | 7691 | |
| o | 3500 | |
| d | 3354 | 7.4% |
| m | 3136 | 6.9% |
| 3121 | 6.9% | |
| u | 1993 | 4.4% |
| p | 1944 | 4.3% |
| f | 1921 | 4.2% |
| y | 1889 | 4.2% |
| Other values (28) | 8737 |
Radiation recode
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| None/Unknown | |
|---|---|
| Beam radiation | 11 |
| Recommended, unknown if administered | 4 |
| Refused (1988+) | 1 |
Length
| Max length | 36 |
|---|---|
| Median length | 12 |
| Mean length | 12.049794 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | None/Unknown |
|---|---|
| 2nd row | None/Unknown |
| 3rd row | None/Unknown |
| 4th row | None/Unknown |
| 5th row | None/Unknown |
Common Values
| Value | Count | Frequency (%) |
| None/Unknown | 2414 | |
| Beam radiation | 11 | 0.5% |
| Recommended, unknown if administered | 4 | 0.2% |
| Refused (1988+) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| none/unknown | 2414 | |
| beam | 11 | 0.4% |
| radiation | 11 | 0.4% |
| recommended | 4 | 0.2% |
| unknown | 4 | 0.2% |
| if | 4 | 0.2% |
| administered | 4 | 0.2% |
| refused | 1 | < 0.1% |
| 1988 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 9687 | |
| o | 4847 | |
| e | 2447 | 8.4% |
| w | 2418 | 8.3% |
| k | 2418 | 8.3% |
| N | 2414 | 8.2% |
| U | 2414 | 8.2% |
| / | 2414 | 8.2% |
| a | 37 | 0.1% |
| i | 34 | 0.1% |
| Other values (18) | 151 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29281 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 9687 | |
| o | 4847 | |
| e | 2447 | 8.4% |
| w | 2418 | 8.3% |
| k | 2418 | 8.3% |
| N | 2414 | 8.2% |
| U | 2414 | 8.2% |
| / | 2414 | 8.2% |
| a | 37 | 0.1% |
| i | 34 | 0.1% |
| Other values (18) | 151 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29281 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 9687 | |
| o | 4847 | |
| e | 2447 | 8.4% |
| w | 2418 | 8.3% |
| k | 2418 | 8.3% |
| N | 2414 | 8.2% |
| U | 2414 | 8.2% |
| / | 2414 | 8.2% |
| a | 37 | 0.1% |
| i | 34 | 0.1% |
| Other values (18) | 151 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29281 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 9687 | |
| o | 4847 | |
| e | 2447 | 8.4% |
| w | 2418 | 8.3% |
| k | 2418 | 8.3% |
| N | 2414 | 8.2% |
| U | 2414 | 8.2% |
| / | 2414 | 8.2% |
| a | 37 | 0.1% |
| i | 34 | 0.1% |
| Other values (18) | 151 | 0.5% |
Chemotherapy recode (yes, no/unk)
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| No/Unknown | |
|---|---|
| Yes |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 7.562963 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No/Unknown |
|---|---|
| 2nd row | No/Unknown |
| 3rd row | No/Unknown |
| 4th row | No/Unknown |
| 5th row | No/Unknown |
Common Values
| Value | Count | Frequency (%) |
| No/Unknown | 1584 | |
| Yes | 846 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no/unknown | 1584 | |
| yes | 846 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 4752 | |
| o | 3168 | |
| N | 1584 | 8.6% |
| / | 1584 | 8.6% |
| U | 1584 | 8.6% |
| k | 1584 | 8.6% |
| w | 1584 | 8.6% |
| Y | 846 | 4.6% |
| e | 846 | 4.6% |
| s | 846 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 18378 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 4752 | |
| o | 3168 | |
| N | 1584 | 8.6% |
| / | 1584 | 8.6% |
| U | 1584 | 8.6% |
| k | 1584 | 8.6% |
| w | 1584 | 8.6% |
| Y | 846 | 4.6% |
| e | 846 | 4.6% |
| s | 846 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 18378 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 4752 | |
| o | 3168 | |
| N | 1584 | 8.6% |
| / | 1584 | 8.6% |
| U | 1584 | 8.6% |
| k | 1584 | 8.6% |
| w | 1584 | 8.6% |
| Y | 846 | 4.6% |
| e | 846 | 4.6% |
| s | 846 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 18378 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 4752 | |
| o | 3168 | |
| N | 1584 | 8.6% |
| / | 1584 | 8.6% |
| U | 1584 | 8.6% |
| k | 1584 | 8.6% |
| w | 1584 | 8.6% |
| Y | 846 | 4.6% |
| e | 846 | 4.6% |
| s | 846 | 4.6% |
RX Summ--Systemic/Sur Seq (2007+)
Categorical
High correlation  Imbalance 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| No systemic therapy and/or surgical procedures | |
|---|---|
| Systemic therapy after surgery | |
| Systemic therapy before surgery | 138 |
| Systemic therapy both before and after surgery | 104 |
| Surgery both before and after systemic therapy | 4 |
| Other values (2) | 2 |
Length
| Max length | 46 |
|---|---|
| Median length | 46 |
| Mean length | 42.950206 |
| Min length | 16 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | No systemic therapy and/or surgical procedures |
|---|---|
| 2nd row | No systemic therapy and/or surgical procedures |
| 3rd row | No systemic therapy and/or surgical procedures |
| 4th row | No systemic therapy and/or surgical procedures |
| 5th row | No systemic therapy and/or surgical procedures |
Common Values
| Value | Count | Frequency (%) |
| No systemic therapy and/or surgical procedures | 1851 | |
| Systemic therapy after surgery | 331 | 13.6% |
| Systemic therapy before surgery | 138 | 5.7% |
| Systemic therapy both before and after surgery | 104 | 4.3% |
| Surgery both before and after systemic therapy | 4 | 0.2% |
| Sequence unknown | 1 | < 0.1% |
| Intraoperative systemic therapy | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| systemic | 2429 | |
| therapy | 2429 | |
| no | 1851 | |
| and/or | 1851 | |
| surgical | 1851 | |
| procedures | 1851 | |
| surgery | 577 | 4.2% |
| after | 439 | 3.2% |
| before | 246 | 1.8% |
| both | 108 | 0.8% |
| Other values (4) | 111 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 11674 | |
| 11313 | 10.8% | |
| e | 10073 | 9.7% |
| s | 8560 | 8.2% |
| a | 6680 | 6.4% |
| c | 6132 | 5.9% |
| o | 5909 | 5.7% |
| y | 5435 | 5.2% |
| t | 5407 | 5.2% |
| i | 4281 | 4.1% |
| Other values (18) | 28905 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 104369 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 11674 | |
| 11313 | 10.8% | |
| e | 10073 | 9.7% |
| s | 8560 | 8.2% |
| a | 6680 | 6.4% |
| c | 6132 | 5.9% |
| o | 5909 | 5.7% |
| y | 5435 | 5.2% |
| t | 5407 | 5.2% |
| i | 4281 | 4.1% |
| Other values (18) | 28905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 104369 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 11674 | |
| 11313 | 10.8% | |
| e | 10073 | 9.7% |
| s | 8560 | 8.2% |
| a | 6680 | 6.4% |
| c | 6132 | 5.9% |
| o | 5909 | 5.7% |
| y | 5435 | 5.2% |
| t | 5407 | 5.2% |
| i | 4281 | 4.1% |
| Other values (18) | 28905 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 104369 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 11674 | |
| 11313 | 10.8% | |
| e | 10073 | 9.7% |
| s | 8560 | 8.2% |
| a | 6680 | 6.4% |
| c | 6132 | 5.9% |
| o | 5909 | 5.7% |
| y | 5435 | 5.2% |
| t | 5407 | 5.2% |
| i | 4281 | 4.1% |
| Other values (18) | 28905 |
| Distinct | 201 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 3 |
| Mean length | 5.563786 |
| Min length | 3 |
Unique
| Unique | 65 ? |
|---|---|
| Unique (%) | 2.7% |
Sample
| 1st row | 045 |
|---|---|
| 2nd row | 000 |
| 3rd row | 000 |
| 4th row | 025 |
| 5th row | 002 |
| Value | Count | Frequency (%) |
| 000 | 748 | |
| unable | 389 | 12.1% |
| to | 389 | 12.1% |
| calculate | 389 | 12.1% |
| 028 | 25 | 0.8% |
| 021 | 24 | 0.7% |
| 014 | 24 | 0.7% |
| 023 | 23 | 0.7% |
| 042 | 23 | 0.7% |
| 007 | 22 | 0.7% |
| Other values (194) | 1153 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3649 | |
| a | 1168 | 8.6% |
| l | 1167 | 8.6% |
| 779 | 5.8% | |
| e | 778 | 5.8% |
| t | 778 | 5.8% |
| c | 778 | 5.8% |
| 1 | 435 | 3.2% |
| b | 389 | 2.9% |
| n | 389 | 2.9% |
| Other values (15) | 3210 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13520 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3649 | |
| a | 1168 | 8.6% |
| l | 1167 | 8.6% |
| 779 | 5.8% | |
| e | 778 | 5.8% |
| t | 778 | 5.8% |
| c | 778 | 5.8% |
| 1 | 435 | 3.2% |
| b | 389 | 2.9% |
| n | 389 | 2.9% |
| Other values (15) | 3210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13520 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3649 | |
| a | 1168 | 8.6% |
| l | 1167 | 8.6% |
| 779 | 5.8% | |
| e | 778 | 5.8% |
| t | 778 | 5.8% |
| c | 778 | 5.8% |
| 1 | 435 | 3.2% |
| b | 389 | 2.9% |
| n | 389 | 2.9% |
| Other values (15) | 3210 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13520 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3649 | |
| a | 1168 | 8.6% |
| l | 1167 | 8.6% |
| 779 | 5.8% | |
| e | 778 | 5.8% |
| t | 778 | 5.8% |
| c | 778 | 5.8% |
| 1 | 435 | 3.2% |
| b | 389 | 2.9% |
| n | 389 | 2.9% |
| Other values (15) | 3210 |
EOD Primary Tumor Recode (2018+)
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 100 | |
|---|---|
| 999 | |
| 700 | 168 |
| 400 | 139 |
| 800 | 7 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 100 |
|---|---|
| 2nd row | 100 |
| 3rd row | 100 |
| 4th row | 100 |
| 5th row | 100 |
Common Values
| Value | Count | Frequency (%) |
| 100 | 1897 | |
| 999 | 219 | 9.0% |
| 700 | 168 | 6.9% |
| 400 | 139 | 5.7% |
| 800 | 7 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 100 | 1897 | |
| 999 | 219 | 9.0% |
| 700 | 168 | 6.9% |
| 400 | 139 | 5.7% |
| 800 | 7 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4422 | |
| 1 | 1897 | |
| 9 | 657 | 9.0% |
| 7 | 168 | 2.3% |
| 4 | 139 | 1.9% |
| 8 | 7 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7290 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4422 | |
| 1 | 1897 | |
| 9 | 657 | 9.0% |
| 7 | 168 | 2.3% |
| 4 | 139 | 1.9% |
| 8 | 7 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7290 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4422 | |
| 1 | 1897 | |
| 9 | 657 | 9.0% |
| 7 | 168 | 2.3% |
| 4 | 139 | 1.9% |
| 8 | 7 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7290 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4422 | |
| 1 | 1897 | |
| 9 | 657 | 9.0% |
| 7 | 168 | 2.3% |
| 4 | 139 | 1.9% |
| 8 | 7 | 0.1% |
EOD Regional Nodes Recode (2018+)
Categorical
High correlation  Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 000 | |
|---|---|
| 999 | 210 |
| 300 | 55 |
| 800 | 17 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 000 |
|---|---|
| 2nd row | 000 |
| 3rd row | 000 |
| 4th row | 000 |
| 5th row | 000 |
Common Values
| Value | Count | Frequency (%) |
| 000 | 2148 | |
| 999 | 210 | 8.6% |
| 300 | 55 | 2.3% |
| 800 | 17 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 000 | 2148 | |
| 999 | 210 | 8.6% |
| 300 | 55 | 2.3% |
| 800 | 17 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6588 | |
| 9 | 630 | 8.6% |
| 3 | 55 | 0.8% |
| 8 | 17 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7290 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6588 | |
| 9 | 630 | 8.6% |
| 3 | 55 | 0.8% |
| 8 | 17 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7290 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6588 | |
| 9 | 630 | 8.6% |
| 3 | 55 | 0.8% |
| 8 | 17 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7290 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6588 | |
| 9 | 630 | 8.6% |
| 3 | 55 | 0.8% |
| 8 | 17 | 0.2% |
EOD Mets Recode (2018+)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 00 | |
|---|---|
| 70 | |
| 10 | 2 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 00 |
|---|---|
| 2nd row | 00 |
| 3rd row | 00 |
| 4th row | 00 |
| 5th row | 00 |
Common Values
| Value | Count | Frequency (%) |
| 00 | 2141 | |
| 70 | 287 | 11.8% |
| 10 | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 00 | 2141 | |
| 70 | 287 | 11.8% |
| 10 | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4571 | |
| 7 | 287 | 5.9% |
| 1 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4571 | |
| 7 | 287 | 5.9% |
| 1 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4571 | |
| 7 | 287 | 5.9% |
| 1 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4571 | |
| 7 | 287 | 5.9% |
| 1 | 2 | < 0.1% |
| Distinct | 192 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
Length
| Max length | 67 |
|---|---|
| Median length | 3 |
| Mean length | 10.3 |
| Min length | 3 |
Unique
| Unique | 44 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 028 |
|---|---|
| 2nd row | 023 |
| 3rd row | 035 |
| 4th row | 022 |
| 5th row | 023 |
| Value | Count | Frequency (%) |
| tumor | 295 | 6.1% |
| size | 294 | 6.1% |
| or | 294 | 6.1% |
| unreasonable | 282 | 5.8% |
| unknown | 282 | 5.8% |
| includes | 282 | 5.8% |
| any | 282 | 5.8% |
| sizes | 282 | 5.8% |
| 401-989 | 282 | 5.8% |
| 035 | 73 | 1.5% |
| Other values (203) | 2177 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2923 | 11.7% |
| 2395 | 9.6% | |
| n | 2000 | 8.0% |
| e | 1485 | 5.9% |
| s | 1448 | 5.8% |
| o | 1206 | 4.8% |
| 1 | 979 | 3.9% |
| i | 922 | 3.7% |
| r | 898 | 3.6% |
| u | 860 | 3.4% |
| Other values (33) | 9913 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25029 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2923 | 11.7% |
| 2395 | 9.6% | |
| n | 2000 | 8.0% |
| e | 1485 | 5.9% |
| s | 1448 | 5.8% |
| o | 1206 | 4.8% |
| 1 | 979 | 3.9% |
| i | 922 | 3.7% |
| r | 898 | 3.6% |
| u | 860 | 3.4% |
| Other values (33) | 9913 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25029 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2923 | 11.7% |
| 2395 | 9.6% | |
| n | 2000 | 8.0% |
| e | 1485 | 5.9% |
| s | 1448 | 5.8% |
| o | 1206 | 4.8% |
| 1 | 979 | 3.9% |
| i | 922 | 3.7% |
| r | 898 | 3.6% |
| u | 860 | 3.4% |
| Other values (33) | 9913 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25029 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2923 | 11.7% |
| 2395 | 9.6% | |
| n | 2000 | 8.0% |
| e | 1485 | 5.9% |
| s | 1448 | 5.8% |
| o | 1206 | 4.8% |
| 1 | 979 | 3.9% |
| i | 922 | 3.7% |
| r | 898 | 3.6% |
| u | 860 | 3.4% |
| Other values (33) | 9913 |
Tumor Size Summary (2016+)
Real number (ℝ)
High correlation 
| Distinct | 216 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 152.69465 |
| Minimum | 0 |
|---|---|
| Maximum | 999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 24 |
| median | 45 |
| Q3 | 98 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 74 |
Descriptive statistics
| Standard deviation | 289.18612 |
|---|---|
| Coefficient of variation (CV) | 1.8938851 |
| Kurtosis | 4.4939333 |
| Mean | 152.69465 |
| Median Absolute Deviation (MAD) | 28 |
| Skewness | 2.4975768 |
| Sum | 371048 |
| Variance | 83628.613 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 999 | 244 | 10.0% |
| 35 | 73 | 3.0% |
| 40 | 68 | 2.8% |
| 25 | 61 | 2.5% |
| 45 | 54 | 2.2% |
| 30 | 52 | 2.1% |
| 20 | 47 | 1.9% |
| 50 | 47 | 1.9% |
| 15 | 45 | 1.9% |
| 5 | 43 | 1.8% |
| Other values (206) | 1696 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 10 | 0.4% |
| 2 | 12 | 0.5% |
| 3 | 25 | |
| 4 | 39 | |
| 5 | 43 | |
| 6 | 35 | |
| 7 | 27 | |
| 8 | 35 | |
| 9 | 12 | 0.5% |
| Value | Count | Frequency (%) |
| 999 | 244 | |
| 990 | 1 | < 0.1% |
| 989 | 2 | 0.1% |
| 380 | 1 | < 0.1% |
| 350 | 1 | < 0.1% |
| 333 | 1 | < 0.1% |
| 330 | 1 | < 0.1% |
| 320 | 1 | < 0.1% |
| 310 | 1 | < 0.1% |
| 307 | 1 | < 0.1% |
Regional nodes examined (1988+)
Categorical
High correlation  Imbalance 
| Distinct | 44 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 00 | |
|---|---|
| 01 | 100 |
| 02 | 61 |
| 99 | 55 |
| 03 | 51 |
| Other values (39) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 00 |
|---|---|
| 2nd row | 07 |
| 3rd row | 02 |
| 4th row | 00 |
| 5th row | 00 |
Common Values
| Value | Count | Frequency (%) |
| 00 | 1874 | |
| 01 | 100 | 4.1% |
| 02 | 61 | 2.5% |
| 99 | 55 | 2.3% |
| 03 | 51 | 2.1% |
| 05 | 30 | 1.2% |
| 04 | 26 | 1.1% |
| 06 | 25 | 1.0% |
| 07 | 16 | 0.7% |
| 12 | 14 | 0.6% |
| Other values (34) | 178 | 7.3% |
Length
| Value | Count | Frequency (%) |
| 00 | 1874 | |
| 01 | 100 | 4.1% |
| 02 | 61 | 2.5% |
| 99 | 55 | 2.3% |
| 03 | 51 | 2.1% |
| 05 | 30 | 1.2% |
| 04 | 26 | 1.1% |
| 06 | 25 | 1.0% |
| 07 | 16 | 0.7% |
| 12 | 14 | 0.6% |
| Other values (34) | 178 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4098 | |
| 1 | 223 | 4.6% |
| 9 | 148 | 3.0% |
| 2 | 119 | 2.4% |
| 3 | 75 | 1.5% |
| 5 | 60 | 1.2% |
| 4 | 47 | 1.0% |
| 6 | 38 | 0.8% |
| 7 | 26 | 0.5% |
| 8 | 26 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4098 | |
| 1 | 223 | 4.6% |
| 9 | 148 | 3.0% |
| 2 | 119 | 2.4% |
| 3 | 75 | 1.5% |
| 5 | 60 | 1.2% |
| 4 | 47 | 1.0% |
| 6 | 38 | 0.8% |
| 7 | 26 | 0.5% |
| 8 | 26 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4098 | |
| 1 | 223 | 4.6% |
| 9 | 148 | 3.0% |
| 2 | 119 | 2.4% |
| 3 | 75 | 1.5% |
| 5 | 60 | 1.2% |
| 4 | 47 | 1.0% |
| 6 | 38 | 0.8% |
| 7 | 26 | 0.5% |
| 8 | 26 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4098 | |
| 1 | 223 | 4.6% |
| 9 | 148 | 3.0% |
| 2 | 119 | 2.4% |
| 3 | 75 | 1.5% |
| 5 | 60 | 1.2% |
| 4 | 47 | 1.0% |
| 6 | 38 | 0.8% |
| 7 | 26 | 0.5% |
| 8 | 26 | 0.5% |
Regional nodes positive (1988+)
Categorical
High correlation  Imbalance 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 98 | |
|---|---|
| 00 | |
| 99 | 59 |
| 01 | 16 |
| 02 | 4 |
| Other values (3) | 5 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 98 |
|---|---|
| 2nd row | 00 |
| 3rd row | 00 |
| 4th row | 98 |
| 5th row | 98 |
Common Values
| Value | Count | Frequency (%) |
| 98 | 1874 | |
| 00 | 472 | 19.4% |
| 99 | 59 | 2.4% |
| 01 | 16 | 0.7% |
| 02 | 4 | 0.2% |
| 95 | 3 | 0.1% |
| 03 | 1 | < 0.1% |
| 04 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 98 | 1874 | |
| 00 | 472 | 19.4% |
| 99 | 59 | 2.4% |
| 01 | 16 | 0.7% |
| 02 | 4 | 0.2% |
| 95 | 3 | 0.1% |
| 03 | 1 | < 0.1% |
| 04 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 1995 | |
| 8 | 1874 | |
| 0 | 966 | |
| 1 | 16 | 0.3% |
| 2 | 4 | 0.1% |
| 5 | 3 | 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1995 | |
| 8 | 1874 | |
| 0 | 966 | |
| 1 | 16 | 0.3% |
| 2 | 4 | 0.1% |
| 5 | 3 | 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1995 | |
| 8 | 1874 | |
| 0 | 966 | |
| 1 | 16 | 0.3% |
| 2 | 4 | 0.1% |
| 5 | 3 | 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9 | 1995 | |
| 8 | 1874 | |
| 0 | 966 | |
| 1 | 16 | 0.3% |
| 2 | 4 | 0.1% |
| 5 | 3 | 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
SEER Combined Mets at DX-bone (2010+)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| No | |
|---|---|
| Unknown | 30 |
| Yes | 7 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.0646091 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | No |
| 3rd row | No |
| 4th row | No |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| No | 2393 | |
| Unknown | 30 | 1.2% |
| Yes | 7 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 2393 | |
| unknown | 30 | 1.2% |
| yes | 7 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2423 | |
| N | 2393 | |
| n | 90 | 1.8% |
| U | 30 | 0.6% |
| k | 30 | 0.6% |
| w | 30 | 0.6% |
| Y | 7 | 0.1% |
| e | 7 | 0.1% |
| s | 7 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5017 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2423 | |
| N | 2393 | |
| n | 90 | 1.8% |
| U | 30 | 0.6% |
| k | 30 | 0.6% |
| w | 30 | 0.6% |
| Y | 7 | 0.1% |
| e | 7 | 0.1% |
| s | 7 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5017 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2423 | |
| N | 2393 | |
| n | 90 | 1.8% |
| U | 30 | 0.6% |
| k | 30 | 0.6% |
| w | 30 | 0.6% |
| Y | 7 | 0.1% |
| e | 7 | 0.1% |
| s | 7 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5017 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2423 | |
| N | 2393 | |
| n | 90 | 1.8% |
| U | 30 | 0.6% |
| k | 30 | 0.6% |
| w | 30 | 0.6% |
| Y | 7 | 0.1% |
| e | 7 | 0.1% |
| s | 7 | 0.1% |
SEER Combined Mets at DX-brain (2010+)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| No | |
|---|---|
| Unknown | 32 |
| Yes | 2 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.0666667 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | No |
| 3rd row | No |
| 4th row | No |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| No | 2396 | |
| Unknown | 32 | 1.3% |
| Yes | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 2396 | |
| unknown | 32 | 1.3% |
| yes | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2428 | |
| N | 2396 | |
| n | 96 | 1.9% |
| U | 32 | 0.6% |
| k | 32 | 0.6% |
| w | 32 | 0.6% |
| Y | 2 | < 0.1% |
| e | 2 | < 0.1% |
| s | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5022 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2428 | |
| N | 2396 | |
| n | 96 | 1.9% |
| U | 32 | 0.6% |
| k | 32 | 0.6% |
| w | 32 | 0.6% |
| Y | 2 | < 0.1% |
| e | 2 | < 0.1% |
| s | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5022 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2428 | |
| N | 2396 | |
| n | 96 | 1.9% |
| U | 32 | 0.6% |
| k | 32 | 0.6% |
| w | 32 | 0.6% |
| Y | 2 | < 0.1% |
| e | 2 | < 0.1% |
| s | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5022 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2428 | |
| N | 2396 | |
| n | 96 | 1.9% |
| U | 32 | 0.6% |
| k | 32 | 0.6% |
| w | 32 | 0.6% |
| Y | 2 | < 0.1% |
| e | 2 | < 0.1% |
| s | 2 | < 0.1% |
SEER Combined Mets at DX-liver (2010+)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| No | |
|---|---|
| Yes | 174 |
| Unknown | 27 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.1271605 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | No |
| 3rd row | No |
| 4th row | No |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| No | 2229 | |
| Yes | 174 | 7.2% |
| Unknown | 27 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 2229 | |
| yes | 174 | 7.2% |
| unknown | 27 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2256 | |
| N | 2229 | |
| Y | 174 | 3.4% |
| e | 174 | 3.4% |
| s | 174 | 3.4% |
| n | 81 | 1.6% |
| U | 27 | 0.5% |
| k | 27 | 0.5% |
| w | 27 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5169 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2256 | |
| N | 2229 | |
| Y | 174 | 3.4% |
| e | 174 | 3.4% |
| s | 174 | 3.4% |
| n | 81 | 1.6% |
| U | 27 | 0.5% |
| k | 27 | 0.5% |
| w | 27 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5169 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2256 | |
| N | 2229 | |
| Y | 174 | 3.4% |
| e | 174 | 3.4% |
| s | 174 | 3.4% |
| n | 81 | 1.6% |
| U | 27 | 0.5% |
| k | 27 | 0.5% |
| w | 27 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5169 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2256 | |
| N | 2229 | |
| Y | 174 | 3.4% |
| e | 174 | 3.4% |
| s | 174 | 3.4% |
| n | 81 | 1.6% |
| U | 27 | 0.5% |
| k | 27 | 0.5% |
| w | 27 | 0.5% |
SEER Combined Mets at DX-lung (2010+)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| No | |
|---|---|
| Unknown | 30 |
| Yes | 7 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.0646091 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | No |
| 3rd row | No |
| 4th row | No |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| No | 2393 | |
| Unknown | 30 | 1.2% |
| Yes | 7 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 2393 | |
| unknown | 30 | 1.2% |
| yes | 7 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2423 | |
| N | 2393 | |
| n | 90 | 1.8% |
| U | 30 | 0.6% |
| k | 30 | 0.6% |
| w | 30 | 0.6% |
| Y | 7 | 0.1% |
| e | 7 | 0.1% |
| s | 7 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5017 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2423 | |
| N | 2393 | |
| n | 90 | 1.8% |
| U | 30 | 0.6% |
| k | 30 | 0.6% |
| w | 30 | 0.6% |
| Y | 7 | 0.1% |
| e | 7 | 0.1% |
| s | 7 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5017 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2423 | |
| N | 2393 | |
| n | 90 | 1.8% |
| U | 30 | 0.6% |
| k | 30 | 0.6% |
| w | 30 | 0.6% |
| Y | 7 | 0.1% |
| e | 7 | 0.1% |
| s | 7 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5017 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2423 | |
| N | 2393 | |
| n | 90 | 1.8% |
| U | 30 | 0.6% |
| k | 30 | 0.6% |
| w | 30 | 0.6% |
| Y | 7 | 0.1% |
| e | 7 | 0.1% |
| s | 7 | 0.1% |
Mets at DX-Distant LN (2016+)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| None; no lymph node metastases | |
|---|---|
| Unknown | 29 |
| Yes; distant lymph node metastases | 10 |
Length
| Max length | 34 |
|---|---|
| Median length | 30 |
| Mean length | 29.741975 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None; no lymph node metastases |
|---|---|
| 2nd row | None; no lymph node metastases |
| 3rd row | None; no lymph node metastases |
| 4th row | None; no lymph node metastases |
| 5th row | None; no lymph node metastases |
Common Values
| Value | Count | Frequency (%) |
| None; no lymph node metastases | 2391 | |
| Unknown | 29 | 1.2% |
| Yes; distant lymph node metastases | 10 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| node | 2401 | |
| lymph | 2401 | |
| metastases | 2401 | |
| no | 2391 | |
| none | 2391 | |
| unknown | 29 | 0.2% |
| yes | 10 | 0.1% |
| distant | 10 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9604 | |
| 9604 | ||
| n | 7280 | |
| s | 7223 | |
| o | 7212 | |
| t | 4822 | 6.7% |
| a | 4812 | 6.7% |
| m | 4802 | 6.6% |
| d | 2411 | 3.3% |
| h | 2401 | 3.3% |
| Other values (10) | 12102 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 72273 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 9604 | |
| 9604 | ||
| n | 7280 | |
| s | 7223 | |
| o | 7212 | |
| t | 4822 | 6.7% |
| a | 4812 | 6.7% |
| m | 4802 | 6.6% |
| d | 2411 | 3.3% |
| h | 2401 | 3.3% |
| Other values (10) | 12102 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 72273 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 9604 | |
| 9604 | ||
| n | 7280 | |
| s | 7223 | |
| o | 7212 | |
| t | 4822 | 6.7% |
| a | 4812 | 6.7% |
| m | 4802 | 6.6% |
| d | 2411 | 3.3% |
| h | 2401 | 3.3% |
| Other values (10) | 12102 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 72273 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 9604 | |
| 9604 | ||
| n | 7280 | |
| s | 7223 | |
| o | 7212 | |
| t | 4822 | 6.7% |
| a | 4812 | 6.7% |
| m | 4802 | 6.6% |
| d | 2411 | 3.3% |
| h | 2401 | 3.3% |
| Other values (10) | 12102 |
Mets at DX-Other (2016+)
Categorical
High correlation  Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| None; no other metastases | |
|---|---|
| Yes; distant mets in known site(s) other than bone, brain, liver, lung, dist LN | 115 |
| Unknown | 29 |
| generalized metastases such as carinomatosis | 18 |
Length
| Max length | 79 |
|---|---|
| Median length | 25 |
| Mean length | 27.481481 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None; no other metastases |
|---|---|
| 2nd row | None; no other metastases |
| 3rd row | None; no other metastases |
| 4th row | None; no other metastases |
| 5th row | None; no other metastases |
Common Values
| Value | Count | Frequency (%) |
| None; no other metastases | 2268 | |
| Yes; distant mets in known site(s) other than bone, brain, liver, lung, dist LN | 115 | 4.7% |
| Unknown | 29 | 1.2% |
| generalized metastases such as carinomatosis | 18 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| other | 2383 | |
| metastases | 2286 | |
| no | 2268 | |
| none | 2268 | |
| yes | 115 | 1.1% |
| distant | 115 | 1.1% |
| mets | 115 | 1.1% |
| in | 115 | 1.1% |
| known | 115 | 1.1% |
| site(s | 115 | 1.1% |
| Other values (12) | 906 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9852 | |
| 8371 | ||
| t | 7663 | |
| s | 7620 | |
| o | 7214 | |
| n | 5579 | |
| a | 4989 | |
| r | 2649 | 4.0% |
| h | 2516 | 3.8% |
| m | 2419 | 3.6% |
| Other values (19) | 7908 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66780 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 9852 | |
| 8371 | ||
| t | 7663 | |
| s | 7620 | |
| o | 7214 | |
| n | 5579 | |
| a | 4989 | |
| r | 2649 | 4.0% |
| h | 2516 | 3.8% |
| m | 2419 | 3.6% |
| Other values (19) | 7908 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66780 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 9852 | |
| 8371 | ||
| t | 7663 | |
| s | 7620 | |
| o | 7214 | |
| n | 5579 | |
| a | 4989 | |
| r | 2649 | 4.0% |
| h | 2516 | 3.8% |
| m | 2419 | 3.6% |
| Other values (19) | 7908 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66780 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 9852 | |
| 8371 | ||
| t | 7663 | |
| s | 7620 | |
| o | 7214 | |
| n | 5579 | |
| a | 4989 | |
| r | 2649 | 4.0% |
| h | 2516 | 3.8% |
| m | 2419 | 3.6% |
| Other values (19) | 7908 |
COD to site recode
Categorical
High correlation  Imbalance 
| Distinct | 38 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Alive | |
|---|---|
| In situ, benign or unknown behavior neoplasm | 57 |
| Other Cause of Death | 38 |
| Soft Tissue including Heart | 33 |
| Diseases of Heart | 25 |
| Other values (33) | 114 |
Length
| Max length | 55 |
|---|---|
| Median length | 5 |
| Mean length | 7.3222222 |
| Min length | 5 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Alive |
|---|---|
| 2nd row | Alive |
| 3rd row | Pancreas |
| 4th row | Alive |
| 5th row | Alive |
Common Values
| Value | Count | Frequency (%) |
| Alive | 2163 | |
| In situ, benign or unknown behavior neoplasm | 57 | 2.3% |
| Other Cause of Death | 38 | 1.6% |
| Soft Tissue including Heart | 33 | 1.4% |
| Diseases of Heart | 25 | 1.0% |
| Stomach | 17 | 0.7% |
| Miscellaneous Malignant Cancer | 11 | 0.5% |
| Esophagus | 8 | 0.3% |
| Cerebrovascular Diseases | 7 | 0.3% |
| State DC not available or state DC available but no COD | 7 | 0.3% |
| Other values (28) | 64 | 2.6% |
Length
| Value | Count | Frequency (%) |
| alive | 2163 | |
| or | 64 | 2.0% |
| of | 63 | 1.9% |
| heart | 60 | 1.8% |
| situ | 57 | 1.8% |
| unknown | 57 | 1.8% |
| in | 57 | 1.8% |
| behavior | 57 | 1.8% |
| neoplasm | 57 | 1.8% |
| benign | 57 | 1.8% |
| Other values (81) | 562 | 17.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2808 | |
| i | 2590 | |
| l | 2371 | |
| v | 2261 | |
| A | 2176 | |
| 824 | 4.6% | |
| n | 640 | 3.6% |
| a | 503 | 2.8% |
| o | 442 | 2.5% |
| s | 436 | 2.5% |
| Other values (40) | 2742 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17793 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2808 | |
| i | 2590 | |
| l | 2371 | |
| v | 2261 | |
| A | 2176 | |
| 824 | 4.6% | |
| n | 640 | 3.6% |
| a | 503 | 2.8% |
| o | 442 | 2.5% |
| s | 436 | 2.5% |
| Other values (40) | 2742 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17793 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2808 | |
| i | 2590 | |
| l | 2371 | |
| v | 2261 | |
| A | 2176 | |
| 824 | 4.6% | |
| n | 640 | 3.6% |
| a | 503 | 2.8% |
| o | 442 | 2.5% |
| s | 436 | 2.5% |
| Other values (40) | 2742 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17793 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2808 | |
| i | 2590 | |
| l | 2371 | |
| v | 2261 | |
| A | 2176 | |
| 824 | 4.6% | |
| n | 640 | 3.6% |
| a | 503 | 2.8% |
| o | 442 | 2.5% |
| s | 436 | 2.5% |
| Other values (40) | 2742 |
SEER cause-specific death classification
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Alive or dead of other cause | |
|---|---|
| Dead (attributable to this cancer dx) | 126 |
| Dead (missing/unknown COD) | 7 |
Length
| Max length | 37 |
|---|---|
| Median length | 28 |
| Mean length | 28.460905 |
| Min length | 26 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alive or dead of other cause |
|---|---|
| 2nd row | Alive or dead of other cause |
| 3rd row | Alive or dead of other cause |
| 4th row | Alive or dead of other cause |
| 5th row | Alive or dead of other cause |
Common Values
| Value | Count | Frequency (%) |
| Alive or dead of other cause | 2297 | |
| Dead (attributable to this cancer dx) | 126 | 5.2% |
| Dead (missing/unknown COD) | 7 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| dead | 2430 | |
| alive | 2297 | |
| or | 2297 | |
| of | 2297 | |
| other | 2297 | |
| cause | 2297 | |
| attributable | 126 | 0.9% |
| to | 126 | 0.9% |
| this | 126 | 0.9% |
| cancer | 126 | 0.9% |
| Other values (3) | 140 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 12129 | ||
| e | 9573 | |
| o | 7024 | |
| a | 5105 | 7.4% |
| d | 4853 | 7.0% |
| r | 4846 | 7.0% |
| t | 2927 | 4.2% |
| i | 2563 | 3.7% |
| c | 2549 | 3.7% |
| s | 2437 | 3.5% |
| Other values (19) | 15154 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 69160 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 12129 | ||
| e | 9573 | |
| o | 7024 | |
| a | 5105 | 7.4% |
| d | 4853 | 7.0% |
| r | 4846 | 7.0% |
| t | 2927 | 4.2% |
| i | 2563 | 3.7% |
| c | 2549 | 3.7% |
| s | 2437 | 3.5% |
| Other values (19) | 15154 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 69160 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 12129 | ||
| e | 9573 | |
| o | 7024 | |
| a | 5105 | 7.4% |
| d | 4853 | 7.0% |
| r | 4846 | 7.0% |
| t | 2927 | 4.2% |
| i | 2563 | 3.7% |
| c | 2549 | 3.7% |
| s | 2437 | 3.5% |
| Other values (19) | 15154 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 69160 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 12129 | ||
| e | 9573 | |
| o | 7024 | |
| a | 5105 | 7.4% |
| d | 4853 | 7.0% |
| r | 4846 | 7.0% |
| t | 2927 | 4.2% |
| i | 2563 | 3.7% |
| c | 2549 | 3.7% |
| s | 2437 | 3.5% |
| Other values (19) | 15154 |
SEER other cause of death classification
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Alive or dead due to cancer | |
|---|---|
| Dead (attributable to causes other than this cancer dx) | 134 |
| Dead (missing/unknown COD) | 7 |
Length
| Max length | 55 |
|---|---|
| Median length | 27 |
| Mean length | 28.541152 |
| Min length | 26 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alive or dead due to cancer |
|---|---|
| 2nd row | Alive or dead due to cancer |
| 3rd row | Dead (attributable to causes other than this cancer dx) |
| 4th row | Alive or dead due to cancer |
| 5th row | Alive or dead due to cancer |
Common Values
| Value | Count | Frequency (%) |
| Alive or dead due to cancer | 2289 | |
| Dead (attributable to causes other than this cancer dx) | 134 | 5.5% |
| Dead (missing/unknown COD) | 7 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| dead | 2430 | |
| to | 2423 | |
| cancer | 2423 | |
| alive | 2289 | |
| or | 2289 | |
| due | 2289 | |
| attributable | 134 | 0.9% |
| causes | 134 | 0.9% |
| other | 134 | 0.9% |
| than | 134 | 0.9% |
| Other values (4) | 282 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 12531 | ||
| e | 9833 | |
| d | 7142 | |
| a | 5389 | |
| r | 4980 | 7.2% |
| c | 4980 | 7.2% |
| o | 4853 | 7.0% |
| t | 3227 | 4.7% |
| n | 2585 | 3.7% |
| i | 2571 | 3.7% |
| Other values (18) | 11264 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 69355 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 12531 | ||
| e | 9833 | |
| d | 7142 | |
| a | 5389 | |
| r | 4980 | 7.2% |
| c | 4980 | 7.2% |
| o | 4853 | 7.0% |
| t | 3227 | 4.7% |
| n | 2585 | 3.7% |
| i | 2571 | 3.7% |
| Other values (18) | 11264 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 69355 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 12531 | ||
| e | 9833 | |
| d | 7142 | |
| a | 5389 | |
| r | 4980 | 7.2% |
| c | 4980 | 7.2% |
| o | 4853 | 7.0% |
| t | 3227 | 4.7% |
| n | 2585 | 3.7% |
| i | 2571 | 3.7% |
| Other values (18) | 11264 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 69355 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 12531 | ||
| e | 9833 | |
| d | 7142 | |
| a | 5389 | |
| r | 4980 | 7.2% |
| c | 4980 | 7.2% |
| o | 4853 | 7.0% |
| t | 3227 | 4.7% |
| n | 2585 | 3.7% |
| i | 2571 | 3.7% |
| Other values (18) | 11264 |
Survival months
Text
| Distinct | 61 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.0049383 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0007 |
|---|---|
| 2nd row | 0025 |
| 3rd row | 0027 |
| 4th row | 0043 |
| 5th row | 0012 |
| Value | Count | Frequency (%) |
| 0000 | 126 | 5.2% |
| 0001 | 89 | 3.7% |
| 0003 | 85 | 3.5% |
| 0004 | 80 | 3.3% |
| 0006 | 79 | 3.3% |
| 0002 | 77 | 3.2% |
| 0005 | 76 | 3.1% |
| 0009 | 76 | 3.1% |
| 0010 | 74 | 3.0% |
| 0008 | 73 | 3.0% |
| Other values (51) | 1595 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6014 | |
| 1 | 920 | 9.5% |
| 2 | 593 | 6.1% |
| 3 | 467 | 4.8% |
| 4 | 449 | 4.6% |
| 5 | 389 | 4.0% |
| 6 | 238 | 2.4% |
| 9 | 238 | 2.4% |
| 8 | 199 | 2.0% |
| 7 | 197 | 2.0% |
| Other values (5) | 28 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9732 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6014 | |
| 1 | 920 | 9.5% |
| 2 | 593 | 6.1% |
| 3 | 467 | 4.8% |
| 4 | 449 | 4.6% |
| 5 | 389 | 4.0% |
| 6 | 238 | 2.4% |
| 9 | 238 | 2.4% |
| 8 | 199 | 2.0% |
| 7 | 197 | 2.0% |
| Other values (5) | 28 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9732 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6014 | |
| 1 | 920 | 9.5% |
| 2 | 593 | 6.1% |
| 3 | 467 | 4.8% |
| 4 | 449 | 4.6% |
| 5 | 389 | 4.0% |
| 6 | 238 | 2.4% |
| 9 | 238 | 2.4% |
| 8 | 199 | 2.0% |
| 7 | 197 | 2.0% |
| Other values (5) | 28 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9732 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6014 | |
| 1 | 920 | 9.5% |
| 2 | 593 | 6.1% |
| 3 | 467 | 4.8% |
| 4 | 449 | 4.6% |
| 5 | 389 | 4.0% |
| 6 | 238 | 2.4% |
| 9 | 238 | 2.4% |
| 8 | 199 | 2.0% |
| 7 | 197 | 2.0% |
| Other values (5) | 28 | 0.3% |
Survival months flag
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Complete dates are available and there are more than 0 days of survival | |
|---|---|
| Complete dates are available and there are 0 days of survival | 37 |
| Incomplete dates are available and there cannot be zero days of follow-up | 9 |
| Not calculated because a Death Certificate Only or Autopsy Only case | 4 |
| Incomplete dates are available and there could be zero days of follow-up | 1 |
Length
| Max length | 73 |
|---|---|
| Median length | 71 |
| Mean length | 70.850617 |
| Min length | 61 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Complete dates are available and there are more than 0 days of survival |
|---|---|
| 2nd row | Complete dates are available and there are more than 0 days of survival |
| 3rd row | Complete dates are available and there are more than 0 days of survival |
| 4th row | Complete dates are available and there are more than 0 days of survival |
| 5th row | Complete dates are available and there are more than 0 days of survival |
Common Values
| Value | Count | Frequency (%) |
| Complete dates are available and there are more than 0 days of survival | 2379 | |
| Complete dates are available and there are 0 days of survival | 37 | 1.5% |
| Incomplete dates are available and there cannot be zero days of follow-up | 9 | 0.4% |
| Not calculated because a Death Certificate Only or Autopsy Only case | 4 | 0.2% |
| Incomplete dates are available and there could be zero days of follow-up | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| are | 4842 | |
| dates | 2426 | |
| available | 2426 | |
| and | 2426 | |
| of | 2426 | |
| there | 2426 | |
| days | 2426 | |
| complete | 2416 | |
| 0 | 2416 | |
| survival | 2416 | |
| Other values (18) | 4852 |
Most occurring characters
| Value | Count | Frequency (%) |
| 29068 | ||
| a | 24230 | |
| e | 21825 | |
| r | 12081 | 7.0% |
| l | 9731 | 5.7% |
| t | 9690 | 5.6% |
| d | 7283 | 4.2% |
| o | 7283 | 4.2% |
| s | 7280 | 4.2% |
| v | 7258 | 4.2% |
| Other values (20) | 36438 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 172167 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 29068 | ||
| a | 24230 | |
| e | 21825 | |
| r | 12081 | 7.0% |
| l | 9731 | 5.7% |
| t | 9690 | 5.6% |
| d | 7283 | 4.2% |
| o | 7283 | 4.2% |
| s | 7280 | 4.2% |
| v | 7258 | 4.2% |
| Other values (20) | 36438 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 172167 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 29068 | ||
| a | 24230 | |
| e | 21825 | |
| r | 12081 | 7.0% |
| l | 9731 | 5.7% |
| t | 9690 | 5.6% |
| d | 7283 | 4.2% |
| o | 7283 | 4.2% |
| s | 7280 | 4.2% |
| v | 7258 | 4.2% |
| Other values (20) | 36438 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 172167 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 29068 | ||
| a | 24230 | |
| e | 21825 | |
| r | 12081 | 7.0% |
| l | 9731 | 5.7% |
| t | 9690 | 5.6% |
| d | 7283 | 4.2% |
| o | 7283 | 4.2% |
| s | 7280 | 4.2% |
| v | 7258 | 4.2% |
| Other values (20) | 36438 |
COD to site rec KM
Categorical
High correlation  Imbalance 
| Distinct | 38 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Alive | |
|---|---|
| In situ, benign or unknown behavior neoplasm | 57 |
| Other Cause of Death | 38 |
| Soft Tissue including Heart | 33 |
| Diseases of Heart | 25 |
| Other values (33) | 114 |
Length
| Max length | 55 |
|---|---|
| Median length | 5 |
| Mean length | 7.3222222 |
| Min length | 5 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Alive |
|---|---|
| 2nd row | Alive |
| 3rd row | Pancreas |
| 4th row | Alive |
| 5th row | Alive |
Common Values
| Value | Count | Frequency (%) |
| Alive | 2163 | |
| In situ, benign or unknown behavior neoplasm | 57 | 2.3% |
| Other Cause of Death | 38 | 1.6% |
| Soft Tissue including Heart | 33 | 1.4% |
| Diseases of Heart | 25 | 1.0% |
| Stomach | 17 | 0.7% |
| Miscellaneous Malignant Cancer | 11 | 0.5% |
| Esophagus | 8 | 0.3% |
| Cerebrovascular Diseases | 7 | 0.3% |
| State DC not available or state DC available but no COD | 7 | 0.3% |
| Other values (28) | 64 | 2.6% |
Length
| Value | Count | Frequency (%) |
| alive | 2163 | |
| or | 64 | 2.0% |
| of | 63 | 1.9% |
| heart | 60 | 1.8% |
| situ | 57 | 1.8% |
| unknown | 57 | 1.8% |
| in | 57 | 1.8% |
| behavior | 57 | 1.8% |
| neoplasm | 57 | 1.8% |
| benign | 57 | 1.8% |
| Other values (81) | 562 | 17.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2808 | |
| i | 2590 | |
| l | 2371 | |
| v | 2261 | |
| A | 2176 | |
| 824 | 4.6% | |
| n | 640 | 3.6% |
| a | 503 | 2.8% |
| o | 442 | 2.5% |
| s | 436 | 2.5% |
| Other values (40) | 2742 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17793 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2808 | |
| i | 2590 | |
| l | 2371 | |
| v | 2261 | |
| A | 2176 | |
| 824 | 4.6% | |
| n | 640 | 3.6% |
| a | 503 | 2.8% |
| o | 442 | 2.5% |
| s | 436 | 2.5% |
| Other values (40) | 2742 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17793 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2808 | |
| i | 2590 | |
| l | 2371 | |
| v | 2261 | |
| A | 2176 | |
| 824 | 4.6% | |
| n | 640 | 3.6% |
| a | 503 | 2.8% |
| o | 442 | 2.5% |
| s | 436 | 2.5% |
| Other values (40) | 2742 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17793 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2808 | |
| i | 2590 | |
| l | 2371 | |
| v | 2261 | |
| A | 2176 | |
| 824 | 4.6% | |
| n | 640 | 3.6% |
| a | 503 | 2.8% |
| o | 442 | 2.5% |
| s | 436 | 2.5% |
| Other values (40) | 2742 |
COD to site recode ICD-O-3 2023 Revision
Categorical
High correlation  Imbalance 
| Distinct | 41 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Alive | |
|---|---|
| Benign and Borderline: All Other sites | 56 |
| Soft Tissue | 33 |
| Other COD | 30 |
| Stomach | 17 |
| Other values (36) | 131 |
Length
| Max length | 78 |
|---|---|
| Median length | 5 |
| Mean length | 7.1547325 |
| Min length | 5 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Alive |
|---|---|
| 2nd row | Alive |
| 3rd row | Pancreas |
| 4th row | Alive |
| 5th row | Alive |
Common Values
| Value | Count | Frequency (%) |
| Alive | 2163 | |
| Benign and Borderline: All Other sites | 56 | 2.3% |
| Soft Tissue | 33 | 1.4% |
| Other COD | 30 | 1.2% |
| Stomach | 17 | 0.7% |
| Other and unspecified disorders of the circulatory system | 16 | 0.7% |
| Miscellaneous Neoplasms | 11 | 0.5% |
| Ischemic heart disease | 8 | 0.3% |
| Esophagus | 8 | 0.3% |
| Cerebrovascular diseases | 7 | 0.3% |
| Other values (31) | 81 | 3.3% |
Length
| Value | Count | Frequency (%) |
| alive | 2163 | |
| other | 108 | 3.4% |
| and | 103 | 3.3% |
| borderline | 57 | 1.8% |
| benign | 57 | 1.8% |
| all | 56 | 1.8% |
| sites | 56 | 1.8% |
| cod | 37 | 1.2% |
| soft | 33 | 1.0% |
| tissue | 33 | 1.0% |
| Other values (91) | 446 | 14.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2868 | |
| i | 2586 | |
| l | 2479 | |
| A | 2247 | |
| v | 2206 | |
| 719 | 4.1% | |
| s | 463 | 2.7% |
| n | 424 | 2.4% |
| r | 396 | 2.3% |
| t | 369 | 2.1% |
| Other values (46) | 2629 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17386 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2868 | |
| i | 2586 | |
| l | 2479 | |
| A | 2247 | |
| v | 2206 | |
| 719 | 4.1% | |
| s | 463 | 2.7% |
| n | 424 | 2.4% |
| r | 396 | 2.3% |
| t | 369 | 2.1% |
| Other values (46) | 2629 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17386 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2868 | |
| i | 2586 | |
| l | 2479 | |
| A | 2247 | |
| v | 2206 | |
| 719 | 4.1% | |
| s | 463 | 2.7% |
| n | 424 | 2.4% |
| r | 396 | 2.3% |
| t | 369 | 2.1% |
| Other values (46) | 2629 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17386 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2868 | |
| i | 2586 | |
| l | 2479 | |
| A | 2247 | |
| v | 2206 | |
| 719 | 4.1% | |
| s | 463 | 2.7% |
| n | 424 | 2.4% |
| r | 396 | 2.3% |
| t | 369 | 2.1% |
| Other values (46) | 2629 |
COD to site recode ICD-O-3 2023 Revision Expanded (1999+)
Categorical
High correlation  Imbalance 
| Distinct | 42 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Alive | |
|---|---|
| Benign and Borderline: All Other sites | 56 |
| Soft Tissue | 33 |
| Other COD | 30 |
| Stomach | 17 |
| Other values (37) | 131 |
Length
| Max length | 78 |
|---|---|
| Median length | 5 |
| Mean length | 7.1695473 |
| Min length | 5 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Alive |
|---|---|
| 2nd row | Alive |
| 3rd row | Pancreas |
| 4th row | Alive |
| 5th row | Alive |
Common Values
| Value | Count | Frequency (%) |
| Alive | 2163 | |
| Benign and Borderline: All Other sites | 56 | 2.3% |
| Soft Tissue | 33 | 1.4% |
| Other COD | 30 | 1.2% |
| Stomach | 17 | 0.7% |
| Other and unspecified disorders of the circulatory system | 16 | 0.7% |
| Esophagus | 8 | 0.3% |
| Ischemic heart disease | 8 | 0.3% |
| Miscellaneous Neoplasms | 8 | 0.3% |
| Cerebrovascular diseases | 7 | 0.3% |
| Other values (32) | 84 | 3.5% |
Length
| Value | Count | Frequency (%) |
| alive | 2163 | |
| other | 109 | 3.5% |
| and | 102 | 3.2% |
| benign | 57 | 1.8% |
| borderline | 57 | 1.8% |
| all | 56 | 1.8% |
| sites | 56 | 1.8% |
| cod | 37 | 1.2% |
| soft | 33 | 1.0% |
| tissue | 33 | 1.0% |
| Other values (93) | 450 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2874 | |
| i | 2589 | |
| l | 2482 | |
| A | 2247 | |
| v | 2205 | |
| 723 | 4.1% | |
| s | 460 | 2.6% |
| n | 420 | 2.4% |
| r | 396 | 2.3% |
| t | 375 | 2.2% |
| Other values (47) | 2651 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17422 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2874 | |
| i | 2589 | |
| l | 2482 | |
| A | 2247 | |
| v | 2205 | |
| 723 | 4.1% | |
| s | 460 | 2.6% |
| n | 420 | 2.4% |
| r | 396 | 2.3% |
| t | 375 | 2.2% |
| Other values (47) | 2651 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17422 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2874 | |
| i | 2589 | |
| l | 2482 | |
| A | 2247 | |
| v | 2205 | |
| 723 | 4.1% | |
| s | 460 | 2.6% |
| n | 420 | 2.4% |
| r | 396 | 2.3% |
| t | 375 | 2.2% |
| Other values (47) | 2651 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17422 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2874 | |
| i | 2589 | |
| l | 2482 | |
| A | 2247 | |
| v | 2205 | |
| 723 | 4.1% | |
| s | 460 | 2.6% |
| n | 420 | 2.4% |
| r | 396 | 2.3% |
| t | 375 | 2.2% |
| Other values (47) | 2651 |
Vital status recode (study cutoff used)
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Alive | |
|---|---|
| Dead |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.8901235 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alive |
|---|---|
| 2nd row | Alive |
| 3rd row | Dead |
| 4th row | Alive |
| 5th row | Alive |
Common Values
| Value | Count | Frequency (%) |
| Alive | 2163 | |
| Dead | 267 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| alive | 2163 | |
| dead | 267 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2430 | |
| A | 2163 | |
| i | 2163 | |
| l | 2163 | |
| v | 2163 | |
| D | 267 | 2.2% |
| a | 267 | 2.2% |
| d | 267 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11883 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2430 | |
| A | 2163 | |
| i | 2163 | |
| l | 2163 | |
| v | 2163 | |
| D | 267 | 2.2% |
| a | 267 | 2.2% |
| d | 267 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11883 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2430 | |
| A | 2163 | |
| i | 2163 | |
| l | 2163 | |
| v | 2163 | |
| D | 267 | 2.2% |
| a | 267 | 2.2% |
| d | 267 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11883 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2430 | |
| A | 2163 | |
| i | 2163 | |
| l | 2163 | |
| v | 2163 | |
| D | 267 | 2.2% |
| a | 267 | 2.2% |
| d | 267 | 2.2% |
Sequence number
Categorical
High correlation  Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| One primary only | |
|---|---|
| 2nd of 2 or more primaries | |
| 1st of 2 or more primaries | 116 |
| 3rd of 3 or more primaries | 90 |
| 4th of 4 or more primaries | 22 |
Length
| Max length | 26 |
|---|---|
| Median length | 16 |
| Mean length | 18.658436 |
| Min length | 16 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | One primary only |
|---|---|
| 2nd row | One primary only |
| 3rd row | 2nd of 2 or more primaries |
| 4th row | 2nd of 2 or more primaries |
| 5th row | 2nd of 2 or more primaries |
Common Values
| Value | Count | Frequency (%) |
| One primary only | 1784 | |
| 2nd of 2 or more primaries | 417 | 17.2% |
| 1st of 2 or more primaries | 116 | 4.8% |
| 3rd of 3 or more primaries | 90 | 3.7% |
| 4th of 4 or more primaries | 22 | 0.9% |
| 5th of 5 or more primaries | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| one | 1784 | |
| primary | 1784 | |
| only | 1784 | |
| of | 646 | 7.0% |
| more | 646 | 7.0% |
| or | 646 | 7.0% |
| primaries | 646 | 7.0% |
| 2 | 533 | 5.8% |
| 2nd | 417 | 4.5% |
| 1st | 116 | 1.3% |
| Other values (6) | 226 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6798 | ||
| r | 6242 | |
| n | 3985 | |
| o | 3722 | |
| y | 3568 | |
| e | 3076 | |
| m | 3076 | |
| i | 3076 | |
| a | 2430 | 5.4% |
| p | 2430 | 5.4% |
| Other values (12) | 6937 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 45340 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 6798 | ||
| r | 6242 | |
| n | 3985 | |
| o | 3722 | |
| y | 3568 | |
| e | 3076 | |
| m | 3076 | |
| i | 3076 | |
| a | 2430 | 5.4% |
| p | 2430 | 5.4% |
| Other values (12) | 6937 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 45340 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 6798 | ||
| r | 6242 | |
| n | 3985 | |
| o | 3722 | |
| y | 3568 | |
| e | 3076 | |
| m | 3076 | |
| i | 3076 | |
| a | 2430 | 5.4% |
| p | 2430 | 5.4% |
| Other values (12) | 6937 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 45340 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 6798 | ||
| r | 6242 | |
| n | 3985 | |
| o | 3722 | |
| y | 3568 | |
| e | 3076 | |
| m | 3076 | |
| i | 3076 | |
| a | 2430 | 5.4% |
| p | 2430 | 5.4% |
| Other values (12) | 6937 |
First malignant primary indicator
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.4 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 1931 | |
| False | 499 | 20.5% |
Primary by international rules
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.4 KiB |
| True | |
|---|---|
| False | 9 |
| Value | Count | Frequency (%) |
| True | 2421 | |
| False | 9 | 0.4% |
Record number recode
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 79 |
| 4 | 24 |
| 5 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1878 | |
| 2 | 447 | 18.4% |
| 3 | 79 | 3.3% |
| 4 | 24 | 1.0% |
| 5 | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1878 | |
| 2 | 447 | 18.4% |
| 3 | 79 | 3.3% |
| 4 | 24 | 1.0% |
| 5 | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1878 | |
| 2 | 447 | 18.4% |
| 3 | 79 | 3.3% |
| 4 | 24 | 1.0% |
| 5 | 2 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1878 | |
| 2 | 447 | 18.4% |
| 3 | 79 | 3.3% |
| 4 | 24 | 1.0% |
| 5 | 2 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1878 | |
| 2 | 447 | 18.4% |
| 3 | 79 | 3.3% |
| 4 | 24 | 1.0% |
| 5 | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1878 | |
| 2 | 447 | 18.4% |
| 3 | 79 | 3.3% |
| 4 | 24 | 1.0% |
| 5 | 2 | 0.1% |
Total number of in situ/malignant tumors for patient
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 114 |
| 4 | 26 |
| 5 | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1809 | |
| 2 | 477 | 19.6% |
| 3 | 114 | 4.7% |
| 4 | 26 | 1.1% |
| 5 | 4 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1809 | |
| 2 | 477 | 19.6% |
| 3 | 114 | 4.7% |
| 4 | 26 | 1.1% |
| 5 | 4 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1809 | |
| 2 | 477 | 19.6% |
| 3 | 114 | 4.7% |
| 4 | 26 | 1.1% |
| 5 | 4 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1809 | |
| 2 | 477 | 19.6% |
| 3 | 114 | 4.7% |
| 4 | 26 | 1.1% |
| 5 | 4 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1809 | |
| 2 | 477 | 19.6% |
| 3 | 114 | 4.7% |
| 4 | 26 | 1.1% |
| 5 | 4 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1809 | |
| 2 | 477 | 19.6% |
| 3 | 114 | 4.7% |
| 4 | 26 | 1.1% |
| 5 | 4 | 0.2% |
Total number of benign/borderline tumors for patient
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 0 | |
|---|---|
| 1 | 38 |
| 3 | 1 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2390 | |
| 1 | 38 | 1.6% |
| 3 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2390 | |
| 1 | 38 | 1.6% |
| 3 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2390 | |
| 1 | 38 | 1.6% |
| 3 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2390 | |
| 1 | 38 | 1.6% |
| 3 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2390 | |
| 1 | 38 | 1.6% |
| 3 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2430 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2390 | |
| 1 | 38 | 1.6% |
| 3 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Age recode with single ages and 90+
Real number (ℝ)
| Distinct | 72 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65.072016 |
| Minimum | 12 |
|---|---|
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.0 KiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 41 |
| Q1 | 57 |
| median | 66 |
| Q3 | 74 |
| 95-th percentile | 85 |
| Maximum | 90 |
| Range | 78 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 13.345527 |
|---|---|
| Coefficient of variation (CV) | 0.20508857 |
| Kurtosis | 0.13248504 |
| Mean | 65.072016 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.50278075 |
| Sum | 158125 |
| Variance | 178.10309 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 70 | 86 | 3.5% |
| 72 | 85 | 3.5% |
| 71 | 83 | 3.4% |
| 69 | 81 | 3.3% |
| 67 | 79 | 3.3% |
| 65 | 74 | 3.0% |
| 64 | 72 | 3.0% |
| 62 | 71 | 2.9% |
| 60 | 71 | 2.9% |
| 74 | 69 | 2.8% |
| Other values (62) | 1659 |
| Value | Count | Frequency (%) |
| 12 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 23 | 3 | |
| 24 | 2 | |
| 25 | 3 | |
| 26 | 2 | |
| 27 | 3 | |
| 28 | 3 |
| Value | Count | Frequency (%) |
| 90 | 42 | |
| 89 | 11 | 0.5% |
| 88 | 20 | |
| 87 | 22 | |
| 86 | 25 | |
| 85 | 31 | |
| 84 | 37 | |
| 83 | 34 | |
| 82 | 27 | |
| 81 | 39 |
Year of follow-up recode
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| 2022 | |
|---|---|
| 2021 | 100 |
| 2020 | 65 |
| 2019 | 39 |
| 2018 | 15 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2022 |
|---|---|
| 2nd row | 2022 |
| 3rd row | 2021 |
| 4th row | 2022 |
| 5th row | 2022 |
Common Values
| Value | Count | Frequency (%) |
| 2022 | 2211 | |
| 2021 | 100 | 4.1% |
| 2020 | 65 | 2.7% |
| 2019 | 39 | 1.6% |
| 2018 | 15 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2022 | 2211 | |
| 2021 | 100 | 4.1% |
| 2020 | 65 | 2.7% |
| 2019 | 39 | 1.6% |
| 2018 | 15 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 7017 | |
| 0 | 2495 | 25.7% |
| 1 | 154 | 1.6% |
| 9 | 39 | 0.4% |
| 8 | 15 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9720 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 7017 | |
| 0 | 2495 | 25.7% |
| 1 | 154 | 1.6% |
| 9 | 39 | 0.4% |
| 8 | 15 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9720 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 7017 | |
| 0 | 2495 | 25.7% |
| 1 | 154 | 1.6% |
| 9 | 39 | 0.4% |
| 8 | 15 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9720 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 7017 | |
| 0 | 2495 | 25.7% |
| 1 | 154 | 1.6% |
| 9 | 39 | 0.4% |
| 8 | 15 | 0.2% |
Patient ID
Real number (ℝ)
High correlation 
| Distinct | 2424 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12075882 |
| Minimum | 812 |
|---|---|
| Maximum | 22445878 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.0 KiB |
Quantile statistics
| Minimum | 812 |
|---|---|
| 5-th percentile | 1527866.1 |
| Q1 | 5966362.8 |
| median | 11394506 |
| Q3 | 17031363 |
| 95-th percentile | 22386542 |
| Maximum | 22445878 |
| Range | 22445066 |
| Interquartile range (IQR) | 11065000 |
Descriptive statistics
| Standard deviation | 7536067.2 |
|---|---|
| Coefficient of variation (CV) | 0.62405936 |
| Kurtosis | -1.398399 |
| Mean | 12075882 |
| Median Absolute Deviation (MAD) | 5589215 |
| Skewness | -0.051652279 |
| Sum | 2.9344393 × 1010 |
| Variance | 5.679231 × 1013 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 11363912 | 2 | 0.1% |
| 1489143 | 2 | 0.1% |
| 5978043 | 2 | 0.1% |
| 6020990 | 2 | 0.1% |
| 1522113 | 2 | 0.1% |
| 11341792 | 2 | 0.1% |
| 641443 | 1 | < 0.1% |
| 651626 | 1 | < 0.1% |
| 22424103 | 1 | < 0.1% |
| 851907 | 1 | < 0.1% |
| Other values (2414) | 2414 |
| Value | Count | Frequency (%) |
| 812 | 1 | |
| 19511 | 1 | |
| 200360 | 1 | |
| 259988 | 1 | |
| 511662 | 1 | |
| 544070 | 1 | |
| 641443 | 1 | |
| 651626 | 1 | |
| 654059 | 1 | |
| 686799 | 1 |
| Value | Count | Frequency (%) |
| 22445878 | 1 | |
| 22445847 | 1 | |
| 22443797 | 1 | |
| 22443698 | 1 | |
| 22443682 | 1 | |
| 22443676 | 1 | |
| 22443657 | 1 | |
| 22443599 | 1 | |
| 22442404 | 1 | |
| 22442286 | 1 |
Type of Reporting Source
Categorical
High correlation  Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Hospital inpatient/outpatient or clinic | |
|---|---|
| Laboratory only (hospital or private) | 148 |
| Other hospital outpatient unit or surgery center (2006+) | 27 |
| Radiation treatment or medical oncology center (2006+) | 11 |
| Physicians office/private medical practitioner (LMD) | 5 |
Length
| Max length | 56 |
|---|---|
| Median length | 39 |
| Mean length | 39.117284 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hospital inpatient/outpatient or clinic |
|---|---|
| 2nd row | Hospital inpatient/outpatient or clinic |
| 3rd row | Hospital inpatient/outpatient or clinic |
| 4th row | Hospital inpatient/outpatient or clinic |
| 5th row | Hospital inpatient/outpatient or clinic |
Common Values
| Value | Count | Frequency (%) |
| Hospital inpatient/outpatient or clinic | 2235 | |
| Laboratory only (hospital or private) | 148 | 6.1% |
| Other hospital outpatient unit or surgery center (2006+) | 27 | 1.1% |
| Radiation treatment or medical oncology center (2006+) | 11 | 0.5% |
| Physicians office/private medical practitioner (LMD) | 5 | 0.2% |
| Autopsy only | 4 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| or | 2421 | |
| hospital | 2410 | |
| inpatient/outpatient | 2235 | |
| clinic | 2235 | |
| only | 152 | 1.5% |
| laboratory | 148 | 1.5% |
| private | 148 | 1.5% |
| center | 38 | 0.4% |
| 2006 | 38 | 0.4% |
| other | 27 | 0.3% |
| Other values (12) | 154 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 14117 | |
| i | 13855 | |
| n | 9227 | |
| o | 7599 | |
| 7576 | ||
| a | 7415 | |
| p | 7069 | |
| e | 4828 | 5.1% |
| l | 4824 | 5.1% |
| c | 4550 | 4.8% |
| Other values (26) | 13995 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 95055 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 14117 | |
| i | 13855 | |
| n | 9227 | |
| o | 7599 | |
| 7576 | ||
| a | 7415 | |
| p | 7069 | |
| e | 4828 | 5.1% |
| l | 4824 | 5.1% |
| c | 4550 | 4.8% |
| Other values (26) | 13995 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 95055 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 14117 | |
| i | 13855 | |
| n | 9227 | |
| o | 7599 | |
| 7576 | ||
| a | 7415 | |
| p | 7069 | |
| e | 4828 | 5.1% |
| l | 4824 | 5.1% |
| c | 4550 | 4.8% |
| Other values (26) | 13995 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 95055 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 14117 | |
| i | 13855 | |
| n | 9227 | |
| o | 7599 | |
| 7576 | ||
| a | 7415 | |
| p | 7069 | |
| e | 4828 | 5.1% |
| l | 4824 | 5.1% |
| c | 4550 | 4.8% |
| Other values (26) | 13995 |
Marital status at diagnosis
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Married (including common law) | |
|---|---|
| Single (never married) | |
| Widowed | |
| Unknown | |
| Divorced | |
| Other values (2) | 48 |
Length
| Max length | 30 |
|---|---|
| Median length | 30 |
| Mean length | 22.514815 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Married (including common law) |
|---|---|
| 2nd row | Single (never married) |
| 3rd row | Unknown |
| 4th row | Married (including common law) |
| 5th row | Single (never married) |
Common Values
| Value | Count | Frequency (%) |
| Married (including common law) | 1349 | |
| Single (never married) | 396 | 16.3% |
| Widowed | 239 | 9.8% |
| Unknown | 200 | 8.2% |
| Divorced | 198 | 8.1% |
| Separated | 26 | 1.1% |
| Unmarried or Domestic Partner | 22 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married | 1745 | |
| including | 1349 | |
| common | 1349 | |
| law | 1349 | |
| single | 396 | 5.4% |
| never | 396 | 5.4% |
| widowed | 239 | 3.3% |
| unknown | 200 | 2.7% |
| divorced | 198 | 2.7% |
| separated | 26 | 0.4% |
| Other values (4) | 88 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 5483 | 10.0% |
| i | 5320 | 9.7% |
| 4905 | 9.0% | |
| r | 4220 | 7.7% |
| d | 3818 | 7.0% |
| e | 3488 | 6.4% |
| o | 3379 | 6.2% |
| a | 3190 | 5.8% |
| m | 3138 | 5.7% |
| l | 3094 | 5.7% |
| Other values (17) | 14676 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 54711 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 5483 | 10.0% |
| i | 5320 | 9.7% |
| 4905 | 9.0% | |
| r | 4220 | 7.7% |
| d | 3818 | 7.0% |
| e | 3488 | 6.4% |
| o | 3379 | 6.2% |
| a | 3190 | 5.8% |
| m | 3138 | 5.7% |
| l | 3094 | 5.7% |
| Other values (17) | 14676 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 54711 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 5483 | 10.0% |
| i | 5320 | 9.7% |
| 4905 | 9.0% | |
| r | 4220 | 7.7% |
| d | 3818 | 7.0% |
| e | 3488 | 6.4% |
| o | 3379 | 6.2% |
| a | 3190 | 5.8% |
| m | 3138 | 5.7% |
| l | 3094 | 5.7% |
| Other values (17) | 14676 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 54711 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 5483 | 10.0% |
| i | 5320 | 9.7% |
| 4905 | 9.0% | |
| r | 4220 | 7.7% |
| d | 3818 | 7.0% |
| e | 3488 | 6.4% |
| o | 3379 | 6.2% |
| a | 3190 | 5.8% |
| m | 3138 | 5.7% |
| l | 3094 | 5.7% |
| Other values (17) | 14676 |
CoC Accredited Flag (2018+)
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| ANALYTIC abstract from facility WITH CoC accreditation | |
|---|---|
| Abstract from facility WITHOUT CoC accreditation | |
| NON-ANALYTIC abstract from facility WITH CoC accreditation | 179 |
Length
| Max length | 58 |
|---|---|
| Median length | 54 |
| Mean length | 53.186008 |
| Min length | 48 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ANALYTIC abstract from facility WITH CoC accreditation |
|---|---|
| 2nd row | ANALYTIC abstract from facility WITH CoC accreditation |
| 3rd row | ANALYTIC abstract from facility WITH CoC accreditation |
| 4th row | Abstract from facility WITHOUT CoC accreditation |
| 5th row | ANALYTIC abstract from facility WITH CoC accreditation |
Common Values
| Value | Count | Frequency (%) |
| ANALYTIC abstract from facility WITH CoC accreditation | 1802 | |
| Abstract from facility WITHOUT CoC accreditation | 449 | 18.5% |
| NON-ANALYTIC abstract from facility WITH CoC accreditation | 179 | 7.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| abstract | 2430 | |
| accreditation | 2430 | |
| from | 2430 | |
| facility | 2430 | |
| coc | 2430 | |
| with | 1981 | |
| analytic | 1802 | |
| without | 449 | 2.7% |
| non-analytic | 179 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 14131 | 10.9% | |
| t | 12150 | 9.4% |
| a | 11701 | 9.1% |
| c | 9720 | 7.5% |
| i | 9720 | 7.5% |
| o | 7290 | 5.6% |
| r | 7290 | 5.6% |
| C | 6841 | 5.3% |
| T | 4860 | 3.8% |
| f | 4860 | 3.8% |
| Other values (18) | 40679 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 129242 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 14131 | 10.9% | |
| t | 12150 | 9.4% |
| a | 11701 | 9.1% |
| c | 9720 | 7.5% |
| i | 9720 | 7.5% |
| o | 7290 | 5.6% |
| r | 7290 | 5.6% |
| C | 6841 | 5.3% |
| T | 4860 | 3.8% |
| f | 4860 | 3.8% |
| Other values (18) | 40679 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 129242 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 14131 | 10.9% | |
| t | 12150 | 9.4% |
| a | 11701 | 9.1% |
| c | 9720 | 7.5% |
| i | 9720 | 7.5% |
| o | 7290 | 5.6% |
| r | 7290 | 5.6% |
| C | 6841 | 5.3% |
| T | 4860 | 3.8% |
| f | 4860 | 3.8% |
| Other values (18) | 40679 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 129242 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 14131 | 10.9% | |
| t | 12150 | 9.4% |
| a | 11701 | 9.1% |
| c | 9720 | 7.5% |
| i | 9720 | 7.5% |
| o | 7290 | 5.6% |
| r | 7290 | 5.6% |
| C | 6841 | 5.3% |
| T | 4860 | 3.8% |
| f | 4860 | 3.8% |
| Other values (18) | 40679 |
Median household income inflation adj to 2023
Categorical
High correlation 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| $120,000+ | |
|---|---|
| $95,000 - $99,999 | |
| $100,000 - $109,999 | |
| $85,000 - $89,999 | |
| $80,000 - $84,999 | |
| Other values (11) |
Length
| Max length | 19 |
|---|---|
| Median length | 17 |
| Mean length | 15.182716 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | $120,000+ |
|---|---|
| 2nd row | $120,000+ |
| 3rd row | $120,000+ |
| 4th row | $120,000+ |
| 5th row | $120,000+ |
Common Values
| Value | Count | Frequency (%) |
| $120,000+ | 641 | |
| $95,000 - $99,999 | 259 | |
| $100,000 - $109,999 | 254 | 10.5% |
| $85,000 - $89,999 | 244 | 10.0% |
| $80,000 - $84,999 | 207 | 8.5% |
| $75,000 - $79,999 | 202 | 8.3% |
| $90,000 - $94,999 | 190 | 7.8% |
| $110,000 - $119,999 | 106 | 4.4% |
| $65,000 - $69,999 | 91 | 3.7% |
| $70,000 - $74,999 | 73 | 3.0% |
| Other values (6) | 163 | 6.7% |
Length
| Value | Count | Frequency (%) |
| 1789 | ||
| 120,000 | 641 | 10.7% |
| 95,000 | 259 | 4.3% |
| 99,999 | 259 | 4.3% |
| 100,000 | 254 | 4.2% |
| 109,999 | 254 | 4.2% |
| 85,000 | 244 | 4.1% |
| 89,999 | 244 | 4.1% |
| 80,000 | 207 | 3.4% |
| 84,999 | 207 | 3.4% |
| Other values (20) | 1649 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9364 | |
| 9 | 7486 | |
| $ | 4218 | |
| , | 4218 | |
| 3577 | 9.7% | |
| - | 1788 | 4.8% |
| 1 | 1573 | 4.3% |
| 5 | 1040 | 2.8% |
| 8 | 902 | 2.4% |
| + | 641 | 1.7% |
| Other values (5) | 2087 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 36894 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 9364 | |
| 9 | 7486 | |
| $ | 4218 | |
| , | 4218 | |
| 3577 | 9.7% | |
| - | 1788 | 4.8% |
| 1 | 1573 | 4.3% |
| 5 | 1040 | 2.8% |
| 8 | 902 | 2.4% |
| + | 641 | 1.7% |
| Other values (5) | 2087 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 36894 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 9364 | |
| 9 | 7486 | |
| $ | 4218 | |
| , | 4218 | |
| 3577 | 9.7% | |
| - | 1788 | 4.8% |
| 1 | 1573 | 4.3% |
| 5 | 1040 | 2.8% |
| 8 | 902 | 2.4% |
| + | 641 | 1.7% |
| Other values (5) | 2087 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 36894 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 9364 | |
| 9 | 7486 | |
| $ | 4218 | |
| , | 4218 | |
| 3577 | 9.7% | |
| - | 1788 | 4.8% |
| 1 | 1573 | 4.3% |
| 5 | 1040 | 2.8% |
| 8 | 902 | 2.4% |
| + | 641 | 1.7% |
| Other values (5) | 2087 | 5.7% |
Rural-Urban Continuum Code
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.0 KiB |
| Counties in metropolitan areas ge 1 million pop | |
|---|---|
| Counties in metropolitan areas of 250,000 to 1 million pop | |
| Counties in metropolitan areas of lt 250 thousand pop | 133 |
| Nonmetropolitan counties adjacent to a metropolitan area | 106 |
| Nonmetropolitan counties not adjacent to a metropolitan area | 97 |
Length
| Max length | 60 |
|---|---|
| Median length | 47 |
| Mean length | 50.512346 |
| Min length | 47 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Counties in metropolitan areas ge 1 million pop |
|---|---|
| 2nd row | Counties in metropolitan areas ge 1 million pop |
| 3rd row | Counties in metropolitan areas ge 1 million pop |
| 4th row | Counties in metropolitan areas ge 1 million pop |
| 5th row | Counties in metropolitan areas ge 1 million pop |
Common Values
| Value | Count | Frequency (%) |
| Counties in metropolitan areas ge 1 million pop | 1592 | |
| Counties in metropolitan areas of 250,000 to 1 million pop | 502 | 20.7% |
| Counties in metropolitan areas of lt 250 thousand pop | 133 | 5.5% |
| Nonmetropolitan counties adjacent to a metropolitan area | 106 | 4.4% |
| Nonmetropolitan counties not adjacent to a metropolitan area | 97 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| counties | 2430 | |
| metropolitan | 2430 | |
| in | 2227 | |
| areas | 2227 | |
| pop | 2227 | |
| million | 2094 | |
| 1 | 2094 | |
| ge | 1592 | |
| to | 705 | 3.4% |
| of | 635 | 3.1% |
| Other values (9) | 1810 |
Most occurring characters
| Value | Count | Frequency (%) |
| 18041 | ||
| o | 13790 | |
| i | 11478 | |
| n | 10020 | |
| e | 9288 | 7.6% |
| t | 8967 | 7.3% |
| a | 8235 | 6.7% |
| p | 7087 | 5.8% |
| l | 6954 | 5.7% |
| r | 5063 | 4.1% |
| Other values (16) | 23822 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 122745 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 18041 | ||
| o | 13790 | |
| i | 11478 | |
| n | 10020 | |
| e | 9288 | 7.6% |
| t | 8967 | 7.3% |
| a | 8235 | 6.7% |
| p | 7087 | 5.8% |
| l | 6954 | 5.7% |
| r | 5063 | 4.1% |
| Other values (16) | 23822 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 122745 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 18041 | ||
| o | 13790 | |
| i | 11478 | |
| n | 10020 | |
| e | 9288 | 7.6% |
| t | 8967 | 7.3% |
| a | 8235 | 6.7% |
| p | 7087 | 5.8% |
| l | 6954 | 5.7% |
| r | 5063 | 4.1% |
| Other values (16) | 23822 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 122745 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 18041 | ||
| o | 13790 | |
| i | 11478 | |
| n | 10020 | |
| e | 9288 | 7.6% |
| t | 8967 | 7.3% |
| a | 8235 | 6.7% |
| p | 7087 | 5.8% |
| l | 6954 | 5.7% |
| r | 5063 | 4.1% |
| Other values (16) | 23822 |
Interactions
Correlations
| AJCC ID (2018+) | Age recode with single ages and 90+ | COD to site rec KM | COD to site recode | COD to site recode ICD-O-3 2023 Revision | COD to site recode ICD-O-3 2023 Revision Expanded (1999+) | Chemotherapy recode (yes, no/unk) | CoC Accredited Flag (2018+) | Derived EOD 2018 M Recode (2018+) | Derived EOD 2018 N Recode (2018+) | Derived EOD 2018 Stage Group Recode (2018+) | Derived EOD 2018 T Recode (2018+) | Derived Summary Grade 2018 (2018+) | Diagnostic Confirmation | EOD Mets Recode (2018+) | EOD Primary Tumor Recode (2018+) | EOD Regional Nodes Recode (2018+) | First malignant primary indicator | Grade Clinical (2018+) | Grade Pathological (2018+) | Marital status at diagnosis | Median household income inflation adj to 2023 | Mets at DX-Distant LN (2016+) | Mets at DX-Other (2016+) | PRCDA 2020 | Patient ID | Primary Site | Primary Site - labeled | Primary by international rules | RX Summ--Scope Reg LN Sur (2003+) | RX Summ--Surg Oth Reg/Dis (2003+) | RX Summ--Surg Prim Site (1998+) | RX Summ--Surg/Rad Seq | RX Summ--Systemic/Sur Seq (2007+) | Race recode (White, Black, Other) | Radiation recode | Reason no cancer-directed surgery | Record number recode | Regional nodes examined (1988+) | Regional nodes positive (1988+) | Rural-Urban Continuum Code | SEER Combined Mets at DX-bone (2010+) | SEER Combined Mets at DX-brain (2010+) | SEER Combined Mets at DX-liver (2010+) | SEER Combined Mets at DX-lung (2010+) | SEER cause-specific death classification | SEER other cause of death classification | Sequence number | Sex | Site recode ICD-O-3 2023 Revision Expanded | Survival months flag | Total number of benign/borderline tumors for patient | Total number of in situ/malignant tumors for patient | Tumor Size Summary (2016+) | Type of Reporting Source | Vital status recode (study cutoff used) | Year of diagnosis | Year of follow-up recode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AJCC ID (2018+) | 1.000 | 0.075 | 0.124 | 0.124 | 0.131 | 0.132 | 0.194 | 0.092 | 0.712 | 0.707 | 0.846 | 0.713 | 0.141 | 0.108 | 0.178 | 0.280 | 0.165 | 0.012 | 0.050 | 0.138 | 0.041 | 0.076 | 0.118 | 0.165 | 0.012 | 0.092 | 0.691 | 0.991 | 0.000 | 0.349 | 0.070 | 0.391 | 0.000 | 0.165 | 0.092 | 0.000 | 0.193 | 0.000 | 0.209 | 0.208 | 0.055 | 0.135 | 0.118 | 0.174 | 0.135 | 0.114 | 0.025 | 0.013 | 0.032 | 0.992 | 0.067 | 0.029 | 0.022 | 0.260 | 0.050 | 0.135 | 0.051 | 0.111 |
| Age recode with single ages and 90+ | 0.075 | 1.000 | 0.053 | 0.053 | 0.052 | 0.050 | 0.086 | 0.075 | 0.044 | 0.106 | 0.058 | 0.082 | 0.041 | 0.087 | 0.000 | 0.070 | 0.115 | 0.199 | 0.016 | 0.047 | 0.156 | 0.000 | 0.000 | 0.009 | 0.017 | -0.108 | -0.090 | 0.000 | 0.024 | 0.000 | 0.020 | 0.093 | 0.062 | 0.082 | 0.064 | 0.102 | 0.086 | 0.104 | 0.071 | 0.159 | 0.000 | 0.000 | 0.000 | 0.017 | 0.000 | 0.112 | 0.127 | 0.097 | 0.095 | 0.016 | 0.000 | 0.000 | 0.100 | 0.028 | 0.000 | 0.257 | 0.004 | 0.094 |
| COD to site rec KM | 0.124 | 0.053 | 1.000 | 1.000 | 0.975 | 0.975 | 0.095 | 0.106 | 0.270 | 0.142 | 0.117 | 0.114 | 0.122 | 0.105 | 0.326 | 0.143 | 0.101 | 0.173 | 0.095 | 0.052 | 0.000 | 0.000 | 0.349 | 0.162 | 0.073 | 0.000 | 0.131 | 0.065 | 0.000 | 0.000 | 0.026 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.191 | 0.075 | 0.056 | 0.000 | 0.058 | 0.145 | 0.216 | 0.213 | 0.173 | 0.937 | 0.940 | 0.104 | 0.081 | 0.066 | 0.220 | 0.000 | 0.140 | 0.000 | 0.176 | 0.993 | 0.128 | 0.411 |
| COD to site recode | 0.124 | 0.053 | 1.000 | 1.000 | 0.975 | 0.975 | 0.095 | 0.106 | 0.270 | 0.142 | 0.117 | 0.114 | 0.122 | 0.105 | 0.326 | 0.143 | 0.101 | 0.173 | 0.095 | 0.052 | 0.000 | 0.000 | 0.349 | 0.162 | 0.073 | 0.000 | 0.131 | 0.065 | 0.000 | 0.000 | 0.026 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.191 | 0.075 | 0.056 | 0.000 | 0.058 | 0.145 | 0.216 | 0.213 | 0.173 | 0.937 | 0.940 | 0.104 | 0.081 | 0.066 | 0.220 | 0.000 | 0.140 | 0.000 | 0.176 | 0.993 | 0.128 | 0.411 |
| COD to site recode ICD-O-3 2023 Revision | 0.131 | 0.052 | 0.975 | 0.975 | 1.000 | 1.000 | 0.092 | 0.109 | 0.271 | 0.149 | 0.113 | 0.135 | 0.126 | 0.119 | 0.326 | 0.159 | 0.105 | 0.174 | 0.096 | 0.048 | 0.045 | 0.000 | 0.360 | 0.164 | 0.060 | 0.000 | 0.125 | 0.055 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.193 | 0.070 | 0.051 | 0.000 | 0.064 | 0.158 | 0.211 | 0.211 | 0.309 | 0.937 | 0.940 | 0.099 | 0.077 | 0.060 | 0.228 | 0.000 | 0.136 | 0.000 | 0.176 | 0.992 | 0.126 | 0.422 |
| COD to site recode ICD-O-3 2023 Revision Expanded (1999+) | 0.132 | 0.050 | 0.975 | 0.975 | 1.000 | 1.000 | 0.092 | 0.109 | 0.274 | 0.150 | 0.113 | 0.135 | 0.125 | 0.117 | 0.326 | 0.158 | 0.104 | 0.173 | 0.094 | 0.045 | 0.046 | 0.000 | 0.360 | 0.171 | 0.056 | 0.000 | 0.125 | 0.051 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.195 | 0.072 | 0.047 | 0.000 | 0.062 | 0.157 | 0.210 | 0.210 | 0.309 | 0.937 | 0.940 | 0.098 | 0.091 | 0.058 | 0.227 | 0.000 | 0.141 | 0.000 | 0.180 | 0.992 | 0.127 | 0.423 |
| Chemotherapy recode (yes, no/unk) | 0.194 | 0.086 | 0.095 | 0.095 | 0.092 | 0.092 | 1.000 | 0.149 | 0.321 | 0.170 | 0.572 | 0.545 | 0.410 | 0.067 | 0.315 | 0.312 | 0.170 | 0.088 | 0.200 | 0.448 | 0.136 | 0.064 | 0.078 | 0.214 | 0.056 | 0.083 | 0.114 | 0.251 | 0.000 | 0.189 | 0.129 | 0.251 | 0.028 | 0.723 | 0.034 | 0.037 | 0.168 | 0.096 | 0.136 | 0.138 | 0.016 | 0.071 | 0.059 | 0.241 | 0.065 | 0.061 | 0.000 | 0.087 | 0.059 | 0.205 | 0.092 | 0.000 | 0.079 | 0.543 | 0.174 | 0.041 | 0.189 | 0.020 |
| CoC Accredited Flag (2018+) | 0.092 | 0.075 | 0.106 | 0.106 | 0.109 | 0.109 | 0.149 | 1.000 | 0.094 | 0.092 | 0.161 | 0.222 | 0.142 | 0.060 | 0.016 | 0.205 | 0.255 | 0.018 | 0.042 | 0.144 | 0.264 | 0.157 | 0.105 | 0.095 | 0.059 | 0.223 | 0.127 | 0.167 | 0.052 | 0.387 | 0.137 | 0.237 | 0.000 | 0.124 | 0.146 | 0.000 | 0.260 | 0.032 | 0.156 | 0.166 | 0.107 | 0.102 | 0.091 | 0.098 | 0.097 | 0.060 | 0.032 | 0.000 | 0.010 | 0.119 | 0.187 | 0.040 | 0.000 | 0.212 | 0.383 | 0.090 | 0.089 | 0.074 |
| Derived EOD 2018 M Recode (2018+) | 0.712 | 0.044 | 0.270 | 0.270 | 0.271 | 0.274 | 0.321 | 0.094 | 1.000 | 0.716 | 0.957 | 0.755 | 0.217 | 0.102 | 0.671 | 0.388 | 0.210 | 0.030 | 0.104 | 0.216 | 0.051 | 0.084 | 0.169 | 0.468 | 0.000 | 0.103 | 0.686 | 0.711 | 0.000 | 0.342 | 0.197 | 0.355 | 0.000 | 0.090 | 0.014 | 0.048 | 0.279 | 0.000 | 0.150 | 0.181 | 0.063 | 0.150 | 0.121 | 0.508 | 0.149 | 0.208 | 0.053 | 0.000 | 0.086 | 0.704 | 0.071 | 0.000 | 0.001 | 0.356 | 0.061 | 0.269 | 0.138 | 0.187 |
| Derived EOD 2018 N Recode (2018+) | 0.707 | 0.106 | 0.142 | 0.142 | 0.149 | 0.150 | 0.170 | 0.092 | 0.716 | 1.000 | 0.777 | 0.713 | 0.142 | 0.094 | 0.192 | 0.278 | 0.702 | 0.026 | 0.037 | 0.151 | 0.044 | 0.058 | 0.125 | 0.170 | 0.000 | 0.082 | 0.683 | 0.702 | 0.017 | 0.346 | 0.089 | 0.290 | 0.000 | 0.093 | 0.010 | 0.049 | 0.204 | 0.000 | 0.188 | 0.446 | 0.046 | 0.138 | 0.132 | 0.181 | 0.135 | 0.137 | 0.064 | 0.000 | 0.032 | 0.698 | 0.070 | 0.000 | 0.022 | 0.325 | 0.056 | 0.162 | 0.045 | 0.125 |
| Derived EOD 2018 Stage Group Recode (2018+) | 0.846 | 0.058 | 0.117 | 0.117 | 0.113 | 0.113 | 0.572 | 0.161 | 0.957 | 0.777 | 1.000 | 0.561 | 0.474 | 0.101 | 0.615 | 0.313 | 0.312 | 0.073 | 0.170 | 0.479 | 0.091 | 0.042 | 0.165 | 0.354 | 0.071 | 0.073 | 0.400 | 0.412 | 0.040 | 0.285 | 0.115 | 0.241 | 0.000 | 0.256 | 0.048 | 0.047 | 0.211 | 0.010 | 0.135 | 0.163 | 0.036 | 0.151 | 0.126 | 0.471 | 0.150 | 0.210 | 0.073 | 0.000 | 0.120 | 0.399 | 0.064 | 0.000 | 0.000 | 0.371 | 0.100 | 0.284 | 0.115 | 0.129 |
| Derived EOD 2018 T Recode (2018+) | 0.713 | 0.082 | 0.114 | 0.114 | 0.135 | 0.135 | 0.545 | 0.222 | 0.755 | 0.713 | 0.561 | 1.000 | 0.196 | 0.066 | 0.296 | 0.569 | 0.314 | 0.096 | 0.066 | 0.219 | 0.101 | 0.041 | 0.167 | 0.204 | 0.056 | 0.051 | 0.394 | 0.429 | 0.057 | 0.352 | 0.092 | 0.268 | 0.000 | 0.208 | 0.037 | 0.000 | 0.212 | 0.041 | 0.166 | 0.176 | 0.037 | 0.230 | 0.155 | 0.271 | 0.174 | 0.160 | 0.049 | 0.036 | 0.112 | 0.409 | 0.107 | 0.000 | 0.034 | 0.732 | 0.167 | 0.215 | 0.092 | 0.100 |
| Derived Summary Grade 2018 (2018+) | 0.141 | 0.041 | 0.122 | 0.122 | 0.126 | 0.125 | 0.410 | 0.142 | 0.217 | 0.142 | 0.474 | 0.196 | 1.000 | 0.093 | 0.191 | 0.204 | 0.145 | 0.057 | 0.661 | 0.858 | 0.059 | 0.018 | 0.072 | 0.123 | 0.075 | 0.028 | 0.083 | 0.162 | 0.050 | 0.216 | 0.071 | 0.235 | 0.000 | 0.299 | 0.027 | 0.000 | 0.216 | 0.046 | 0.233 | 0.098 | 0.000 | 0.075 | 0.062 | 0.165 | 0.064 | 0.142 | 0.076 | 0.042 | 0.082 | 0.108 | 0.039 | 0.000 | 0.049 | 0.195 | 0.069 | 0.214 | 0.081 | 0.099 |
| Diagnostic Confirmation | 0.108 | 0.087 | 0.105 | 0.105 | 0.119 | 0.117 | 0.067 | 0.060 | 0.102 | 0.094 | 0.101 | 0.066 | 0.093 | 1.000 | 0.031 | 0.065 | 0.026 | 0.061 | 0.039 | 0.085 | 0.023 | 0.000 | 0.073 | 0.088 | 0.041 | 0.025 | 0.058 | 0.068 | 0.000 | 0.000 | 0.000 | 0.104 | 0.000 | 0.000 | 0.078 | 0.000 | 0.161 | 0.000 | 0.000 | 0.000 | 0.003 | 0.071 | 0.145 | 0.085 | 0.071 | 0.077 | 0.069 | 0.015 | 0.000 | 0.059 | 0.231 | 0.000 | 0.000 | 0.000 | 0.056 | 0.108 | 0.017 | 0.042 |
| EOD Mets Recode (2018+) | 0.178 | 0.000 | 0.326 | 0.326 | 0.326 | 0.326 | 0.315 | 0.016 | 0.671 | 0.192 | 0.615 | 0.296 | 0.191 | 0.031 | 1.000 | 0.333 | 0.158 | 0.020 | 0.096 | 0.191 | 0.000 | 0.055 | 0.192 | 0.465 | 0.016 | 0.078 | 0.176 | 0.292 | 0.000 | 0.037 | 0.174 | 0.235 | 0.000 | 0.054 | 0.014 | 0.037 | 0.328 | 0.000 | 0.000 | 0.042 | 0.061 | 0.104 | 0.051 | 0.535 | 0.102 | 0.204 | 0.048 | 0.000 | 0.094 | 0.194 | 0.000 | 0.000 | 0.000 | 0.422 | 0.000 | 0.260 | 0.135 | 0.172 |
| EOD Primary Tumor Recode (2018+) | 0.280 | 0.070 | 0.143 | 0.143 | 0.159 | 0.158 | 0.312 | 0.205 | 0.388 | 0.278 | 0.313 | 0.569 | 0.204 | 0.065 | 0.333 | 1.000 | 0.371 | 0.051 | 0.089 | 0.200 | 0.129 | 0.050 | 0.173 | 0.226 | 0.039 | 0.057 | 0.208 | 0.238 | 0.065 | 0.412 | 0.134 | 0.277 | 0.063 | 0.129 | 0.044 | 0.019 | 0.258 | 0.008 | 0.208 | 0.217 | 0.033 | 0.258 | 0.162 | 0.295 | 0.168 | 0.162 | 0.033 | 0.000 | 0.070 | 0.218 | 0.106 | 0.000 | 0.029 | 0.463 | 0.187 | 0.209 | 0.076 | 0.102 |
| EOD Regional Nodes Recode (2018+) | 0.165 | 0.115 | 0.101 | 0.101 | 0.105 | 0.104 | 0.170 | 0.255 | 0.210 | 0.702 | 0.312 | 0.314 | 0.145 | 0.026 | 0.158 | 0.371 | 1.000 | 0.012 | 0.010 | 0.151 | 0.211 | 0.017 | 0.208 | 0.171 | 0.028 | 0.087 | 0.133 | 0.171 | 0.056 | 0.483 | 0.154 | 0.215 | 0.000 | 0.087 | 0.069 | 0.073 | 0.227 | 0.000 | 0.229 | 0.425 | 0.037 | 0.219 | 0.183 | 0.221 | 0.191 | 0.121 | 0.077 | 0.000 | 0.031 | 0.138 | 0.164 | 0.000 | 0.038 | 0.371 | 0.295 | 0.146 | 0.026 | 0.093 |
| First malignant primary indicator | 0.012 | 0.199 | 0.173 | 0.173 | 0.174 | 0.173 | 0.088 | 0.018 | 0.030 | 0.026 | 0.073 | 0.096 | 0.057 | 0.061 | 0.020 | 0.051 | 0.012 | 1.000 | 0.046 | 0.045 | 0.041 | 0.000 | 0.024 | 0.035 | 0.000 | 0.093 | 0.021 | 0.000 | 0.110 | 0.060 | 0.000 | 0.103 | 0.000 | 0.118 | 0.075 | 0.000 | 0.036 | 0.882 | 0.051 | 0.042 | 0.000 | 0.028 | 0.033 | 0.040 | 0.039 | 0.000 | 0.138 | 0.962 | 0.000 | 0.052 | 0.045 | 0.028 | 0.869 | 0.000 | 0.056 | 0.078 | 0.000 | 0.040 |
| Grade Clinical (2018+) | 0.050 | 0.016 | 0.095 | 0.095 | 0.096 | 0.094 | 0.200 | 0.042 | 0.104 | 0.037 | 0.170 | 0.066 | 0.661 | 0.039 | 0.096 | 0.089 | 0.010 | 0.046 | 1.000 | 0.284 | 0.027 | 0.027 | 0.059 | 0.060 | 0.072 | 0.029 | 0.049 | 0.181 | 0.000 | 0.000 | 0.018 | 0.101 | 0.000 | 0.071 | 0.000 | 0.000 | 0.046 | 0.096 | 0.410 | 0.000 | 0.018 | 0.041 | 0.000 | 0.051 | 0.000 | 0.082 | 0.008 | 0.092 | 0.022 | 0.125 | 0.000 | 0.000 | 0.098 | 0.130 | 0.000 | 0.086 | 0.055 | 0.052 |
| Grade Pathological (2018+) | 0.138 | 0.047 | 0.052 | 0.052 | 0.048 | 0.045 | 0.448 | 0.144 | 0.216 | 0.151 | 0.479 | 0.219 | 0.858 | 0.085 | 0.191 | 0.200 | 0.151 | 0.045 | 0.284 | 1.000 | 0.058 | 0.017 | 0.074 | 0.111 | 0.049 | 0.000 | 0.080 | 0.107 | 0.050 | 0.223 | 0.085 | 0.303 | 0.000 | 0.362 | 0.018 | 0.020 | 0.291 | 0.043 | 0.244 | 0.106 | 0.005 | 0.070 | 0.053 | 0.173 | 0.065 | 0.139 | 0.067 | 0.037 | 0.077 | 0.095 | 0.021 | 0.000 | 0.043 | 0.213 | 0.062 | 0.217 | 0.066 | 0.076 |
| Marital status at diagnosis | 0.041 | 0.156 | 0.000 | 0.000 | 0.045 | 0.046 | 0.136 | 0.264 | 0.051 | 0.044 | 0.091 | 0.101 | 0.059 | 0.023 | 0.000 | 0.129 | 0.211 | 0.041 | 0.027 | 0.058 | 1.000 | 0.024 | 0.131 | 0.104 | 0.029 | 0.068 | 0.000 | 0.043 | 0.000 | 0.247 | 0.074 | 0.069 | 0.103 | 0.039 | 0.175 | 0.050 | 0.090 | 0.028 | 0.129 | 0.120 | 0.049 | 0.116 | 0.113 | 0.111 | 0.113 | 0.021 | 0.033 | 0.012 | 0.231 | 0.015 | 0.156 | 0.023 | 0.037 | 0.000 | 0.263 | 0.075 | 0.028 | 0.015 |
| Median household income inflation adj to 2023 | 0.076 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.064 | 0.157 | 0.084 | 0.058 | 0.042 | 0.041 | 0.018 | 0.000 | 0.055 | 0.050 | 0.017 | 0.000 | 0.027 | 0.017 | 0.024 | 1.000 | 0.016 | 0.059 | 0.436 | 0.514 | 0.042 | 0.045 | 0.000 | 0.134 | 0.024 | 0.054 | 0.000 | 0.057 | 0.296 | 0.061 | 0.041 | 0.000 | 0.052 | 0.041 | 0.478 | 0.038 | 0.065 | 0.053 | 0.127 | 0.000 | 0.000 | 0.000 | 0.000 | 0.014 | 0.000 | 0.000 | 0.015 | 0.000 | 0.065 | 0.000 | 0.126 | 0.025 |
| Mets at DX-Distant LN (2016+) | 0.118 | 0.000 | 0.349 | 0.349 | 0.360 | 0.360 | 0.078 | 0.105 | 0.169 | 0.125 | 0.165 | 0.167 | 0.072 | 0.073 | 0.192 | 0.173 | 0.208 | 0.024 | 0.059 | 0.074 | 0.131 | 0.016 | 1.000 | 0.640 | 0.015 | 0.042 | 0.135 | 0.155 | 0.000 | 0.211 | 0.162 | 0.205 | 0.000 | 0.006 | 0.079 | 0.000 | 0.233 | 0.000 | 0.063 | 0.111 | 0.000 | 0.676 | 0.649 | 0.632 | 0.691 | 0.128 | 0.018 | 0.000 | 0.005 | 0.130 | 0.125 | 0.000 | 0.000 | 0.068 | 0.169 | 0.162 | 0.055 | 0.109 |
| Mets at DX-Other (2016+) | 0.165 | 0.009 | 0.162 | 0.162 | 0.164 | 0.171 | 0.214 | 0.095 | 0.468 | 0.170 | 0.354 | 0.204 | 0.123 | 0.088 | 0.465 | 0.226 | 0.171 | 0.035 | 0.060 | 0.111 | 0.104 | 0.059 | 0.640 | 1.000 | 0.000 | 0.061 | 0.141 | 0.159 | 0.000 | 0.205 | 0.194 | 0.209 | 0.026 | 0.073 | 0.065 | 0.000 | 0.196 | 0.000 | 0.000 | 0.111 | 0.026 | 0.647 | 0.626 | 0.593 | 0.651 | 0.177 | 0.017 | 0.000 | 0.062 | 0.147 | 0.101 | 0.000 | 0.000 | 0.286 | 0.121 | 0.204 | 0.073 | 0.113 |
| PRCDA 2020 | 0.012 | 0.017 | 0.073 | 0.073 | 0.060 | 0.056 | 0.056 | 0.059 | 0.000 | 0.000 | 0.071 | 0.056 | 0.075 | 0.041 | 0.016 | 0.039 | 0.028 | 0.000 | 0.072 | 0.049 | 0.029 | 0.436 | 0.015 | 0.000 | 1.000 | 0.874 | 0.007 | 0.080 | 0.000 | 0.070 | 0.034 | 0.084 | 0.000 | 0.082 | 0.319 | 0.000 | 0.053 | 0.000 | 0.054 | 0.073 | 0.432 | 0.016 | 0.000 | 0.000 | 0.000 | 0.000 | 0.022 | 0.000 | 0.000 | 0.000 | 0.051 | 0.000 | 0.021 | 0.067 | 0.020 | 0.000 | 0.149 | 0.000 |
| Patient ID | 0.092 | -0.108 | 0.000 | 0.000 | 0.000 | 0.000 | 0.083 | 0.223 | 0.103 | 0.082 | 0.073 | 0.051 | 0.028 | 0.025 | 0.078 | 0.057 | 0.087 | 0.093 | 0.029 | 0.000 | 0.068 | 0.514 | 0.042 | 0.061 | 0.874 | 1.000 | -0.004 | 0.058 | 0.019 | 0.122 | 0.057 | 0.049 | 0.000 | 0.046 | 0.365 | 0.046 | 0.077 | 0.168 | 0.000 | 0.051 | 0.392 | 0.042 | 0.076 | 0.085 | 0.054 | 0.025 | 0.000 | 0.092 | 0.020 | 0.048 | 0.072 | 0.000 | 0.090 | 0.003 | 0.058 | 0.000 | 0.084 | 0.000 |
| Primary Site | 0.691 | -0.090 | 0.131 | 0.131 | 0.125 | 0.125 | 0.114 | 0.127 | 0.686 | 0.683 | 0.400 | 0.394 | 0.083 | 0.058 | 0.176 | 0.208 | 0.133 | 0.021 | 0.049 | 0.080 | 0.000 | 0.042 | 0.135 | 0.141 | 0.007 | -0.004 | 1.000 | 0.992 | 0.000 | 0.243 | 0.042 | 0.414 | 0.000 | 0.000 | 0.000 | 0.000 | 0.122 | 0.068 | 0.171 | 0.182 | 0.052 | 0.144 | 0.123 | 0.193 | 0.146 | 0.135 | 0.000 | 0.000 | 0.020 | 0.933 | 0.039 | 0.000 | 0.000 | 0.202 | 0.030 | 0.155 | 0.047 | 0.137 |
| Primary Site - labeled | 0.991 | 0.000 | 0.065 | 0.065 | 0.055 | 0.051 | 0.251 | 0.167 | 0.711 | 0.702 | 0.412 | 0.429 | 0.162 | 0.068 | 0.292 | 0.238 | 0.171 | 0.000 | 0.181 | 0.107 | 0.043 | 0.045 | 0.155 | 0.159 | 0.080 | 0.058 | 0.992 | 1.000 | 0.000 | 0.187 | 0.000 | 0.261 | 0.000 | 0.091 | 0.124 | 0.082 | 0.148 | 0.061 | 0.152 | 0.172 | 0.070 | 0.118 | 0.300 | 0.215 | 0.169 | 0.182 | 0.094 | 0.000 | 0.080 | 0.994 | 0.000 | 0.000 | 0.073 | 0.115 | 0.000 | 0.185 | 0.075 | 0.198 |
| Primary by international rules | 0.000 | 0.024 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.052 | 0.000 | 0.017 | 0.040 | 0.057 | 0.050 | 0.000 | 0.000 | 0.065 | 0.056 | 0.110 | 0.000 | 0.050 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.019 | 0.000 | 0.000 | 1.000 | 0.000 | 0.062 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.103 | 0.112 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.022 | 0.000 | 0.000 | 0.000 | 0.117 | 0.000 | 0.000 | 0.000 | 0.000 | 0.153 | 0.000 | 0.087 | 0.000 | 0.000 | 0.000 |
| RX Summ--Scope Reg LN Sur (2003+) | 0.349 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.189 | 0.387 | 0.342 | 0.346 | 0.285 | 0.352 | 0.216 | 0.000 | 0.037 | 0.412 | 0.483 | 0.060 | 0.000 | 0.223 | 0.247 | 0.134 | 0.211 | 0.205 | 0.070 | 0.122 | 0.243 | 0.187 | 0.000 | 1.000 | 0.197 | 0.441 | 0.000 | 0.159 | 0.094 | 0.000 | 0.604 | 0.021 | 0.769 | 0.487 | 0.000 | 0.191 | 0.262 | 0.186 | 0.262 | 0.000 | 0.000 | 0.010 | 0.107 | 0.219 | 0.332 | 0.330 | 0.043 | 0.208 | 0.328 | 0.080 | 0.037 | 0.089 |
| RX Summ--Surg Oth Reg/Dis (2003+) | 0.070 | 0.020 | 0.026 | 0.026 | 0.000 | 0.000 | 0.129 | 0.137 | 0.197 | 0.089 | 0.115 | 0.092 | 0.071 | 0.000 | 0.174 | 0.134 | 0.154 | 0.000 | 0.018 | 0.085 | 0.074 | 0.024 | 0.162 | 0.194 | 0.034 | 0.057 | 0.042 | 0.000 | 0.062 | 0.197 | 1.000 | 0.351 | 0.050 | 0.117 | 0.095 | 0.000 | 0.326 | 0.000 | 0.143 | 0.099 | 0.034 | 0.162 | 0.153 | 0.184 | 0.159 | 0.113 | 0.119 | 0.000 | 0.000 | 0.015 | 0.022 | 0.000 | 0.000 | 0.337 | 0.143 | 0.058 | 0.035 | 0.042 |
| RX Summ--Surg Prim Site (1998+) | 0.391 | 0.093 | 0.000 | 0.000 | 0.000 | 0.000 | 0.251 | 0.237 | 0.355 | 0.290 | 0.241 | 0.268 | 0.235 | 0.104 | 0.235 | 0.277 | 0.215 | 0.103 | 0.101 | 0.303 | 0.069 | 0.054 | 0.205 | 0.209 | 0.084 | 0.049 | 0.414 | 0.261 | 0.000 | 0.441 | 0.351 | 1.000 | 0.536 | 0.187 | 0.091 | 0.169 | 0.510 | 0.108 | 0.210 | 0.263 | 0.089 | 0.192 | 0.175 | 0.279 | 0.190 | 0.180 | 0.081 | 0.118 | 0.071 | 0.323 | 0.000 | 0.000 | 0.066 | 0.155 | 0.126 | 0.275 | 0.059 | 0.133 |
| RX Summ--Surg/Rad Seq | 0.000 | 0.062 | 0.000 | 0.000 | 0.000 | 0.000 | 0.028 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.063 | 0.000 | 0.000 | 0.000 | 0.000 | 0.103 | 0.000 | 0.000 | 0.026 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.050 | 0.536 | 1.000 | 0.081 | 0.036 | 0.425 | 0.000 | 0.093 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.096 | 0.000 | 0.000 | 0.000 | 0.000 | 0.089 | 0.000 | 0.000 | 0.000 | 0.021 | 0.000 |
| RX Summ--Systemic/Sur Seq (2007+) | 0.165 | 0.082 | 0.000 | 0.000 | 0.000 | 0.000 | 0.723 | 0.124 | 0.090 | 0.093 | 0.256 | 0.208 | 0.299 | 0.000 | 0.054 | 0.129 | 0.087 | 0.118 | 0.071 | 0.362 | 0.039 | 0.057 | 0.006 | 0.073 | 0.082 | 0.046 | 0.000 | 0.091 | 0.000 | 0.159 | 0.117 | 0.187 | 0.081 | 1.000 | 0.047 | 0.035 | 0.122 | 0.048 | 0.222 | 0.137 | 0.070 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.044 | 0.024 | 0.095 | 0.000 | 0.000 | 0.052 | 0.299 | 0.045 | 0.071 | 0.062 | 0.046 |
| Race recode (White, Black, Other) | 0.092 | 0.064 | 0.000 | 0.000 | 0.000 | 0.000 | 0.034 | 0.146 | 0.014 | 0.010 | 0.048 | 0.037 | 0.027 | 0.078 | 0.014 | 0.044 | 0.069 | 0.075 | 0.000 | 0.018 | 0.175 | 0.296 | 0.079 | 0.065 | 0.319 | 0.365 | 0.000 | 0.124 | 0.000 | 0.094 | 0.095 | 0.091 | 0.036 | 0.047 | 1.000 | 0.038 | 0.064 | 0.036 | 0.077 | 0.082 | 0.209 | 0.075 | 0.092 | 0.085 | 0.078 | 0.030 | 0.034 | 0.041 | 0.059 | 0.084 | 0.087 | 0.000 | 0.047 | 0.000 | 0.130 | 0.014 | 0.045 | 0.000 |
| Radiation recode | 0.000 | 0.102 | 0.000 | 0.000 | 0.000 | 0.000 | 0.037 | 0.000 | 0.048 | 0.049 | 0.047 | 0.000 | 0.000 | 0.000 | 0.037 | 0.019 | 0.073 | 0.000 | 0.000 | 0.020 | 0.050 | 0.061 | 0.000 | 0.000 | 0.000 | 0.046 | 0.000 | 0.082 | 0.000 | 0.000 | 0.000 | 0.169 | 0.425 | 0.035 | 0.038 | 1.000 | 0.000 | 0.000 | 0.079 | 0.285 | 0.000 | 0.000 | 0.146 | 0.024 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.029 | 0.000 | 0.027 | 0.000 | 0.212 | 0.000 | 0.016 | 0.000 | 0.000 |
| Reason no cancer-directed surgery | 0.193 | 0.086 | 0.191 | 0.191 | 0.193 | 0.195 | 0.168 | 0.260 | 0.279 | 0.204 | 0.211 | 0.212 | 0.216 | 0.161 | 0.328 | 0.258 | 0.227 | 0.036 | 0.046 | 0.291 | 0.090 | 0.041 | 0.233 | 0.196 | 0.053 | 0.077 | 0.122 | 0.148 | 0.103 | 0.604 | 0.326 | 0.510 | 0.000 | 0.122 | 0.064 | 0.000 | 1.000 | 0.000 | 0.101 | 0.153 | 0.029 | 0.207 | 0.194 | 0.290 | 0.209 | 0.244 | 0.164 | 0.000 | 0.066 | 0.132 | 0.131 | 0.095 | 0.000 | 0.125 | 0.196 | 0.309 | 0.042 | 0.143 |
| Record number recode | 0.000 | 0.104 | 0.075 | 0.075 | 0.070 | 0.072 | 0.096 | 0.032 | 0.000 | 0.000 | 0.010 | 0.041 | 0.046 | 0.000 | 0.000 | 0.008 | 0.000 | 0.882 | 0.096 | 0.043 | 0.028 | 0.000 | 0.000 | 0.000 | 0.000 | 0.168 | 0.068 | 0.061 | 0.112 | 0.021 | 0.000 | 0.108 | 0.093 | 0.048 | 0.036 | 0.000 | 0.000 | 1.000 | 0.128 | 0.056 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.077 | 0.815 | 0.064 | 0.047 | 0.000 | 0.150 | 0.660 | 0.000 | 0.000 | 0.060 | 0.011 | 0.000 |
| Regional nodes examined (1988+) | 0.209 | 0.071 | 0.056 | 0.056 | 0.051 | 0.047 | 0.136 | 0.156 | 0.150 | 0.188 | 0.135 | 0.166 | 0.233 | 0.000 | 0.000 | 0.208 | 0.229 | 0.051 | 0.410 | 0.244 | 0.129 | 0.052 | 0.063 | 0.000 | 0.054 | 0.000 | 0.171 | 0.152 | 0.000 | 0.769 | 0.143 | 0.210 | 0.000 | 0.222 | 0.077 | 0.079 | 0.101 | 0.128 | 1.000 | 0.593 | 0.000 | 0.059 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.149 | 0.078 | 0.163 | 0.099 | 0.000 | 0.104 | 0.261 | 0.143 | 0.038 | 0.043 | 0.032 |
| Regional nodes positive (1988+) | 0.208 | 0.159 | 0.000 | 0.000 | 0.000 | 0.000 | 0.138 | 0.166 | 0.181 | 0.446 | 0.163 | 0.176 | 0.098 | 0.000 | 0.042 | 0.217 | 0.425 | 0.042 | 0.000 | 0.106 | 0.120 | 0.041 | 0.111 | 0.111 | 0.073 | 0.051 | 0.182 | 0.172 | 0.000 | 0.487 | 0.099 | 0.263 | 0.000 | 0.137 | 0.082 | 0.285 | 0.153 | 0.056 | 0.593 | 1.000 | 0.000 | 0.100 | 0.090 | 0.117 | 0.096 | 0.088 | 0.058 | 0.043 | 0.051 | 0.174 | 0.146 | 0.011 | 0.039 | 0.206 | 0.174 | 0.075 | 0.045 | 0.103 |
| Rural-Urban Continuum Code | 0.055 | 0.000 | 0.058 | 0.058 | 0.064 | 0.062 | 0.016 | 0.107 | 0.063 | 0.046 | 0.036 | 0.037 | 0.000 | 0.003 | 0.061 | 0.033 | 0.037 | 0.000 | 0.018 | 0.005 | 0.049 | 0.478 | 0.000 | 0.026 | 0.432 | 0.392 | 0.052 | 0.070 | 0.000 | 0.000 | 0.034 | 0.089 | 0.000 | 0.070 | 0.209 | 0.000 | 0.029 | 0.000 | 0.000 | 0.000 | 1.000 | 0.016 | 0.000 | 0.026 | 0.034 | 0.021 | 0.000 | 0.000 | 0.048 | 0.057 | 0.014 | 0.000 | 0.000 | 0.080 | 0.024 | 0.021 | 0.042 | 0.000 |
| SEER Combined Mets at DX-bone (2010+) | 0.135 | 0.000 | 0.145 | 0.145 | 0.158 | 0.157 | 0.071 | 0.102 | 0.150 | 0.138 | 0.151 | 0.230 | 0.075 | 0.071 | 0.104 | 0.258 | 0.219 | 0.028 | 0.041 | 0.070 | 0.116 | 0.038 | 0.676 | 0.647 | 0.016 | 0.042 | 0.144 | 0.118 | 0.000 | 0.191 | 0.162 | 0.192 | 0.000 | 0.000 | 0.075 | 0.000 | 0.207 | 0.000 | 0.059 | 0.100 | 0.016 | 1.000 | 0.661 | 0.628 | 0.683 | 0.060 | 0.058 | 0.000 | 0.000 | 0.139 | 0.122 | 0.000 | 0.010 | 0.000 | 0.166 | 0.109 | 0.091 | 0.052 |
| SEER Combined Mets at DX-brain (2010+) | 0.118 | 0.000 | 0.216 | 0.216 | 0.211 | 0.210 | 0.059 | 0.091 | 0.121 | 0.132 | 0.126 | 0.155 | 0.062 | 0.145 | 0.051 | 0.162 | 0.183 | 0.033 | 0.000 | 0.053 | 0.113 | 0.065 | 0.649 | 0.626 | 0.000 | 0.076 | 0.123 | 0.300 | 0.000 | 0.262 | 0.153 | 0.175 | 0.000 | 0.000 | 0.092 | 0.146 | 0.194 | 0.000 | 0.000 | 0.090 | 0.000 | 0.661 | 1.000 | 0.601 | 0.684 | 0.068 | 0.037 | 0.000 | 0.000 | 0.126 | 0.118 | 0.000 | 0.000 | 0.000 | 0.158 | 0.116 | 0.040 | 0.037 |
| SEER Combined Mets at DX-liver (2010+) | 0.174 | 0.017 | 0.213 | 0.213 | 0.211 | 0.210 | 0.241 | 0.098 | 0.508 | 0.181 | 0.471 | 0.271 | 0.165 | 0.085 | 0.535 | 0.295 | 0.221 | 0.040 | 0.051 | 0.173 | 0.111 | 0.053 | 0.632 | 0.593 | 0.000 | 0.085 | 0.193 | 0.215 | 0.022 | 0.186 | 0.184 | 0.279 | 0.000 | 0.000 | 0.085 | 0.024 | 0.290 | 0.000 | 0.000 | 0.117 | 0.026 | 0.628 | 0.601 | 1.000 | 0.623 | 0.148 | 0.032 | 0.000 | 0.061 | 0.194 | 0.132 | 0.000 | 0.000 | 0.240 | 0.157 | 0.194 | 0.118 | 0.133 |
| SEER Combined Mets at DX-lung (2010+) | 0.135 | 0.000 | 0.173 | 0.173 | 0.309 | 0.309 | 0.065 | 0.097 | 0.149 | 0.135 | 0.150 | 0.174 | 0.064 | 0.071 | 0.102 | 0.168 | 0.191 | 0.039 | 0.000 | 0.065 | 0.113 | 0.127 | 0.691 | 0.651 | 0.000 | 0.054 | 0.146 | 0.169 | 0.000 | 0.262 | 0.159 | 0.190 | 0.000 | 0.000 | 0.078 | 0.000 | 0.209 | 0.000 | 0.000 | 0.096 | 0.034 | 0.683 | 0.684 | 0.623 | 1.000 | 0.087 | 0.000 | 0.000 | 0.000 | 0.171 | 0.122 | 0.020 | 0.010 | 0.000 | 0.166 | 0.109 | 0.045 | 0.083 |
| SEER cause-specific death classification | 0.114 | 0.112 | 0.937 | 0.937 | 0.937 | 0.937 | 0.061 | 0.060 | 0.208 | 0.137 | 0.210 | 0.160 | 0.142 | 0.077 | 0.204 | 0.162 | 0.121 | 0.000 | 0.082 | 0.139 | 0.021 | 0.000 | 0.128 | 0.177 | 0.000 | 0.025 | 0.135 | 0.182 | 0.000 | 0.000 | 0.113 | 0.180 | 0.000 | 0.000 | 0.030 | 0.000 | 0.244 | 0.000 | 0.000 | 0.088 | 0.021 | 0.060 | 0.068 | 0.148 | 0.087 | 1.000 | 0.708 | 0.000 | 0.043 | 0.150 | 0.132 | 0.000 | 0.000 | 0.107 | 0.023 | 0.684 | 0.113 | 0.352 |
| SEER other cause of death classification | 0.025 | 0.127 | 0.940 | 0.940 | 0.940 | 0.940 | 0.000 | 0.032 | 0.053 | 0.064 | 0.073 | 0.049 | 0.076 | 0.069 | 0.048 | 0.033 | 0.077 | 0.138 | 0.008 | 0.067 | 0.033 | 0.000 | 0.018 | 0.017 | 0.022 | 0.000 | 0.000 | 0.094 | 0.000 | 0.000 | 0.119 | 0.081 | 0.000 | 0.000 | 0.034 | 0.000 | 0.164 | 0.077 | 0.000 | 0.058 | 0.000 | 0.058 | 0.037 | 0.032 | 0.000 | 0.708 | 1.000 | 0.097 | 0.038 | 0.060 | 0.167 | 0.000 | 0.102 | 0.000 | 0.116 | 0.706 | 0.095 | 0.324 |
| Sequence number | 0.013 | 0.097 | 0.104 | 0.104 | 0.099 | 0.098 | 0.087 | 0.000 | 0.000 | 0.000 | 0.000 | 0.036 | 0.042 | 0.015 | 0.000 | 0.000 | 0.000 | 0.962 | 0.092 | 0.037 | 0.012 | 0.000 | 0.000 | 0.000 | 0.000 | 0.092 | 0.000 | 0.000 | 0.117 | 0.010 | 0.000 | 0.118 | 0.096 | 0.044 | 0.041 | 0.000 | 0.000 | 0.815 | 0.149 | 0.043 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.097 | 1.000 | 0.071 | 0.000 | 0.000 | 0.063 | 0.813 | 0.000 | 0.000 | 0.075 | 0.049 | 0.010 |
| Sex | 0.032 | 0.095 | 0.081 | 0.081 | 0.077 | 0.091 | 0.059 | 0.010 | 0.086 | 0.032 | 0.120 | 0.112 | 0.082 | 0.000 | 0.094 | 0.070 | 0.031 | 0.000 | 0.022 | 0.077 | 0.231 | 0.000 | 0.005 | 0.062 | 0.000 | 0.020 | 0.020 | 0.080 | 0.000 | 0.107 | 0.000 | 0.071 | 0.000 | 0.024 | 0.059 | 0.000 | 0.066 | 0.064 | 0.078 | 0.051 | 0.048 | 0.000 | 0.000 | 0.061 | 0.000 | 0.043 | 0.038 | 0.071 | 1.000 | 0.053 | 0.022 | 0.000 | 0.057 | 0.139 | 0.028 | 0.067 | 0.031 | 0.000 |
| Site recode ICD-O-3 2023 Revision Expanded | 0.992 | 0.016 | 0.066 | 0.066 | 0.060 | 0.058 | 0.205 | 0.119 | 0.704 | 0.698 | 0.399 | 0.409 | 0.108 | 0.059 | 0.194 | 0.218 | 0.138 | 0.052 | 0.125 | 0.095 | 0.015 | 0.014 | 0.130 | 0.147 | 0.000 | 0.048 | 0.933 | 0.994 | 0.000 | 0.219 | 0.015 | 0.323 | 0.000 | 0.095 | 0.084 | 0.029 | 0.132 | 0.047 | 0.163 | 0.174 | 0.057 | 0.139 | 0.126 | 0.194 | 0.171 | 0.150 | 0.060 | 0.000 | 0.053 | 1.000 | 0.033 | 0.000 | 0.000 | 0.109 | 0.000 | 0.160 | 0.038 | 0.131 |
| Survival months flag | 0.067 | 0.000 | 0.220 | 0.220 | 0.228 | 0.227 | 0.092 | 0.187 | 0.071 | 0.070 | 0.064 | 0.107 | 0.039 | 0.231 | 0.000 | 0.106 | 0.164 | 0.045 | 0.000 | 0.021 | 0.156 | 0.000 | 0.125 | 0.101 | 0.051 | 0.072 | 0.039 | 0.000 | 0.000 | 0.332 | 0.022 | 0.000 | 0.000 | 0.000 | 0.087 | 0.000 | 0.131 | 0.000 | 0.099 | 0.146 | 0.014 | 0.122 | 0.118 | 0.132 | 0.122 | 0.132 | 0.167 | 0.000 | 0.022 | 0.033 | 1.000 | 0.020 | 0.000 | 0.000 | 0.541 | 0.127 | 0.032 | 0.095 |
| Total number of benign/borderline tumors for patient | 0.029 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.040 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.028 | 0.000 | 0.000 | 0.023 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.330 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.027 | 0.095 | 0.150 | 0.000 | 0.011 | 0.000 | 0.000 | 0.000 | 0.000 | 0.020 | 0.000 | 0.000 | 0.063 | 0.000 | 0.000 | 0.020 | 1.000 | 0.033 | 0.000 | 0.011 | 0.000 | 0.000 | 0.000 |
| Total number of in situ/malignant tumors for patient | 0.022 | 0.100 | 0.140 | 0.140 | 0.136 | 0.141 | 0.079 | 0.000 | 0.001 | 0.022 | 0.000 | 0.034 | 0.049 | 0.000 | 0.000 | 0.029 | 0.038 | 0.869 | 0.098 | 0.043 | 0.037 | 0.015 | 0.000 | 0.000 | 0.021 | 0.090 | 0.000 | 0.073 | 0.153 | 0.043 | 0.000 | 0.066 | 0.089 | 0.052 | 0.047 | 0.000 | 0.000 | 0.660 | 0.104 | 0.039 | 0.000 | 0.010 | 0.000 | 0.000 | 0.010 | 0.000 | 0.102 | 0.813 | 0.057 | 0.000 | 0.000 | 0.033 | 1.000 | 0.000 | 0.007 | 0.079 | 0.014 | 0.000 |
| Tumor Size Summary (2016+) | 0.260 | 0.028 | 0.000 | 0.000 | 0.000 | 0.000 | 0.543 | 0.212 | 0.356 | 0.325 | 0.371 | 0.732 | 0.195 | 0.000 | 0.422 | 0.463 | 0.371 | 0.000 | 0.130 | 0.213 | 0.000 | 0.000 | 0.068 | 0.286 | 0.067 | 0.003 | 0.202 | 0.115 | 0.000 | 0.208 | 0.337 | 0.155 | 0.000 | 0.299 | 0.000 | 0.212 | 0.125 | 0.000 | 0.261 | 0.206 | 0.080 | 0.000 | 0.000 | 0.240 | 0.000 | 0.107 | 0.000 | 0.000 | 0.139 | 0.109 | 0.000 | 0.000 | 0.000 | 1.000 | 0.133 | 0.220 | 0.088 | 0.078 |
| Type of Reporting Source | 0.050 | 0.000 | 0.176 | 0.176 | 0.176 | 0.180 | 0.174 | 0.383 | 0.061 | 0.056 | 0.100 | 0.167 | 0.069 | 0.056 | 0.000 | 0.187 | 0.295 | 0.056 | 0.000 | 0.062 | 0.263 | 0.065 | 0.169 | 0.121 | 0.020 | 0.058 | 0.030 | 0.000 | 0.087 | 0.328 | 0.143 | 0.126 | 0.000 | 0.045 | 0.130 | 0.000 | 0.196 | 0.000 | 0.143 | 0.174 | 0.024 | 0.166 | 0.158 | 0.157 | 0.166 | 0.023 | 0.116 | 0.000 | 0.028 | 0.000 | 0.541 | 0.011 | 0.007 | 0.133 | 1.000 | 0.114 | 0.039 | 0.028 |
| Vital status recode (study cutoff used) | 0.135 | 0.257 | 0.993 | 0.993 | 0.992 | 0.992 | 0.041 | 0.090 | 0.269 | 0.162 | 0.284 | 0.215 | 0.214 | 0.108 | 0.260 | 0.209 | 0.146 | 0.078 | 0.086 | 0.217 | 0.075 | 0.000 | 0.162 | 0.204 | 0.000 | 0.000 | 0.155 | 0.185 | 0.000 | 0.080 | 0.058 | 0.275 | 0.000 | 0.071 | 0.014 | 0.016 | 0.309 | 0.060 | 0.038 | 0.075 | 0.021 | 0.109 | 0.116 | 0.194 | 0.109 | 0.684 | 0.706 | 0.075 | 0.067 | 0.160 | 0.127 | 0.000 | 0.079 | 0.220 | 0.114 | 1.000 | 0.218 | 0.658 |
| Year of diagnosis | 0.051 | 0.004 | 0.128 | 0.128 | 0.126 | 0.127 | 0.189 | 0.089 | 0.138 | 0.045 | 0.115 | 0.092 | 0.081 | 0.017 | 0.135 | 0.076 | 0.026 | 0.000 | 0.055 | 0.066 | 0.028 | 0.126 | 0.055 | 0.073 | 0.149 | 0.084 | 0.047 | 0.075 | 0.000 | 0.037 | 0.035 | 0.059 | 0.021 | 0.062 | 0.045 | 0.000 | 0.042 | 0.011 | 0.043 | 0.045 | 0.042 | 0.091 | 0.040 | 0.118 | 0.045 | 0.113 | 0.095 | 0.049 | 0.031 | 0.038 | 0.032 | 0.000 | 0.014 | 0.088 | 0.039 | 0.218 | 1.000 | 0.208 |
| Year of follow-up recode | 0.111 | 0.094 | 0.411 | 0.411 | 0.422 | 0.423 | 0.020 | 0.074 | 0.187 | 0.125 | 0.129 | 0.100 | 0.099 | 0.042 | 0.172 | 0.102 | 0.093 | 0.040 | 0.052 | 0.076 | 0.015 | 0.025 | 0.109 | 0.113 | 0.000 | 0.000 | 0.137 | 0.198 | 0.000 | 0.089 | 0.042 | 0.133 | 0.000 | 0.046 | 0.000 | 0.000 | 0.143 | 0.000 | 0.032 | 0.103 | 0.000 | 0.052 | 0.037 | 0.133 | 0.083 | 0.352 | 0.324 | 0.010 | 0.000 | 0.131 | 0.095 | 0.000 | 0.000 | 0.078 | 0.028 | 0.658 | 0.208 | 1.000 |
Missing values
Sample
| Race recode (White, Black, Other) | Sex | Year of diagnosis | PRCDA 2020 | Site recode ICD-O-3 2023 Revision Expanded | Primary Site - labeled | Primary Site | Derived Summary Grade 2018 (2018+) | Grade Clinical (2018+) | Grade Pathological (2018+) | Diagnostic Confirmation | AJCC ID (2018+) | Derived EOD 2018 T Recode (2018+) | Derived EOD 2018 N Recode (2018+) | Derived EOD 2018 M Recode (2018+) | Derived EOD 2018 Stage Group Recode (2018+) | RX Summ--Surg Prim Site (1998+) | RX Summ--Scope Reg LN Sur (2003+) | RX Summ--Surg Oth Reg/Dis (2003+) | RX Summ--Surg/Rad Seq | Reason no cancer-directed surgery | Radiation recode | Chemotherapy recode (yes, no/unk) | RX Summ--Systemic/Sur Seq (2007+) | Time from diagnosis to treatment in days recode | EOD Primary Tumor Recode (2018+) | EOD Regional Nodes Recode (2018+) | EOD Mets Recode (2018+) | Tumor Size Over Time Recode (1988+) | Tumor Size Summary (2016+) | Regional nodes examined (1988+) | Regional nodes positive (1988+) | SEER Combined Mets at DX-bone (2010+) | SEER Combined Mets at DX-brain (2010+) | SEER Combined Mets at DX-liver (2010+) | SEER Combined Mets at DX-lung (2010+) | Mets at DX-Distant LN (2016+) | Mets at DX-Other (2016+) | COD to site recode | SEER cause-specific death classification | SEER other cause of death classification | Survival months | Survival months flag | COD to site rec KM | COD to site recode ICD-O-3 2023 Revision | COD to site recode ICD-O-3 2023 Revision Expanded (1999+) | Vital status recode (study cutoff used) | Sequence number | First malignant primary indicator | Primary by international rules | Record number recode | Total number of in situ/malignant tumors for patient | Total number of benign/borderline tumors for patient | Age recode with single ages and 90+ | Year of follow-up recode | Patient ID | Type of Reporting Source | Marital status at diagnosis | CoC Accredited Flag (2018+) | Median household income inflation adj to 2023 | Rural-Urban Continuum Code | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | White | Female | 2022 | Not PRCDA | Stomach | C16.3-Gastric antrum | 163 | L | 9 | L | Positive histology | GIST: Gastric and Omental | T2 | N0 | M0 | 1A | 30 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 045 | 100 | 000 | 00 | 028 | 028 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0007 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 2 | 1 | 0 | 67 | 2022 | 812 | Hospital inpatient/outpatient or clinic | Married (including common law) | ANALYTIC abstract from facility WITH CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 4 | Black | Female | 2020 | Not PRCDA | Small Intestine | C17.1-Jejunum | 171 | L | 9 | L | Positive histology | GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal | T2 | N0 | M0 | 1 | 30 | 4 or more regional lymph nodes removed | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 000 | 100 | 000 | 00 | 023 | 023 | 07 | 00 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0025 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 2 | 1 | 0 | 77 | 2022 | 19511 | Hospital inpatient/outpatient or clinic | Single (never married) | ANALYTIC abstract from facility WITH CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 15 | Black | Female | 2018 | Not PRCDA | Small Intestine | C17.9-Small intestine, NOS | 179 | 9 | 9 | 9 | Positive histology | GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal | T2 | N0 | M0 | 99 | 30 | 1 to 3 regional lymph nodes removed | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 000 | 100 | 000 | 00 | 035 | 035 | 02 | 00 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Pancreas | Alive or dead of other cause | Dead (attributable to causes other than this cancer dx) | 0027 | Complete dates are available and there are more than 0 days of survival | Pancreas | Pancreas | Pancreas | Dead | 2nd of 2 or more primaries | No | Yes | 2 | 2 | 0 | 86 | 2021 | 200360 | Hospital inpatient/outpatient or clinic | Unknown | ANALYTIC abstract from facility WITH CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 19 | White | Female | 2019 | Not PRCDA | Stomach | C16.6-Greater curvature of stomach NOS | 166 | L | 9 | L | Positive histology | GIST: Gastric and Omental | T2 | N0 | M0 | 1A | 30 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 025 | 100 | 000 | 00 | 022 | 022 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0043 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | 2nd of 2 or more primaries | No | Yes | 2 | 2 | 0 | 70 | 2022 | 259988 | Hospital inpatient/outpatient or clinic | Married (including common law) | Abstract from facility WITHOUT CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 42 | White | Female | 2021 | Not PRCDA | Stomach | C16.1-Fundus of stomach | 161 | 9 | 9 | 9 | Positive histology | GIST: Gastric and Omental | T2 | N0 | M0 | 99 | 30 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 002 | 100 | 000 | 00 | 023 | 023 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0012 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | 2nd of 2 or more primaries | Yes | Yes | 2 | 2 | 0 | 77 | 2022 | 511662 | Hospital inpatient/outpatient or clinic | Single (never married) | ANALYTIC abstract from facility WITH CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 46 | White | Male | 2019 | Not PRCDA | Stomach | C16.9-Stomach, NOS | 169 | H | 9 | H | Positive histology | GIST: Gastric and Omental | T4 | N0 | M1 | 4 | 30 | 4 or more regional lymph nodes removed | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 000 | 700 | 000 | 70 | 250 | 250 | 21 | 00 | No | No | No | No | None; no lymph node metastases | Yes; distant mets in known site(s) other than bone, brain, liver, lung, dist LN | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0040 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | 2nd of 2 or more primaries | No | Yes | 2 | 2 | 0 | 79 | 2022 | 544070 | Hospital inpatient/outpatient or clinic | Married (including common law) | ANALYTIC abstract from facility WITH CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 63 | White | Male | 2022 | Not PRCDA | Small Intestine | C17.1-Jejunum | 171 | L | 9 | L | Positive histology | GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal | T4 | N0 | M0 | 3A | 30 | 4 or more regional lymph nodes removed | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | Yes | Systemic therapy after surgery | 055 | 400 | 000 | 00 | 130 | 130 | 07 | 00 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0006 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | 2nd of 2 or more primaries | No | Yes | 2 | 2 | 0 | 73 | 2022 | 641443 | Hospital inpatient/outpatient or clinic | Married (including common law) | ANALYTIC abstract from facility WITH CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 65 | Unknown | Female | 2021 | Not PRCDA | Stomach | C16.9-Stomach, NOS | 169 | 9 | 9 | 9 | Positive histology | GIST: Gastric and Omental | TX | N0 | M0 | 99 | 00 | Unknown or not applicable | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Not recommended | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | Unable to calculate | 999 | 999 | 00 | Unknown or size unreasonable (includes any tumor sizes 401-989) | 999 | 99 | 99 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0018 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 2 | 1 | 0 | 50 | 2022 | 651626 | Laboratory only (hospital or private) | Unknown | Abstract from facility WITHOUT CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 66 | Black | Male | 2019 | Not PRCDA | Stomach | C16.9-Stomach, NOS | 169 | 9 | 9 | 9 | Positive histology | GIST: Gastric and Omental | T3 | N0 | M0 | 99 | 30 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 000 | 400 | 000 | 00 | 094 | 094 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Miscellaneous Malignant Cancer | Alive or dead of other cause | Dead (attributable to causes other than this cancer dx) | 0035 | Complete dates are available and there are more than 0 days of survival | Miscellaneous Malignant Cancer | Miscellaneous Neoplasms | Miscellaneous Neoplasms | Dead | 2nd of 2 or more primaries | No | Yes | 2 | 2 | 0 | 82 | 2022 | 654059 | Hospital inpatient/outpatient or clinic | Unknown | ANALYTIC abstract from facility WITH CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| 82 | White | Male | 2019 | Not PRCDA | Stomach | C16.9-Stomach, NOS | 169 | L | L | L | Positive histology | GIST: Gastric and Omental | T3 | N0 | M0 | 1B | 30 | 1 to 3 regional lymph nodes removed | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 028 | 100 | 000 | 00 | 053 | 053 | 03 | 00 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0037 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | 2nd of 2 or more primaries | No | Yes | 2 | 2 | 0 | 77 | 2022 | 686799 | Hospital inpatient/outpatient or clinic | Married (including common law) | ANALYTIC abstract from facility WITH CoC accreditation | $120,000+ | Counties in metropolitan areas ge 1 million pop |
| Race recode (White, Black, Other) | Sex | Year of diagnosis | PRCDA 2020 | Site recode ICD-O-3 2023 Revision Expanded | Primary Site - labeled | Primary Site | Derived Summary Grade 2018 (2018+) | Grade Clinical (2018+) | Grade Pathological (2018+) | Diagnostic Confirmation | AJCC ID (2018+) | Derived EOD 2018 T Recode (2018+) | Derived EOD 2018 N Recode (2018+) | Derived EOD 2018 M Recode (2018+) | Derived EOD 2018 Stage Group Recode (2018+) | RX Summ--Surg Prim Site (1998+) | RX Summ--Scope Reg LN Sur (2003+) | RX Summ--Surg Oth Reg/Dis (2003+) | RX Summ--Surg/Rad Seq | Reason no cancer-directed surgery | Radiation recode | Chemotherapy recode (yes, no/unk) | RX Summ--Systemic/Sur Seq (2007+) | Time from diagnosis to treatment in days recode | EOD Primary Tumor Recode (2018+) | EOD Regional Nodes Recode (2018+) | EOD Mets Recode (2018+) | Tumor Size Over Time Recode (1988+) | Tumor Size Summary (2016+) | Regional nodes examined (1988+) | Regional nodes positive (1988+) | SEER Combined Mets at DX-bone (2010+) | SEER Combined Mets at DX-brain (2010+) | SEER Combined Mets at DX-liver (2010+) | SEER Combined Mets at DX-lung (2010+) | Mets at DX-Distant LN (2016+) | Mets at DX-Other (2016+) | COD to site recode | SEER cause-specific death classification | SEER other cause of death classification | Survival months | Survival months flag | COD to site rec KM | COD to site recode ICD-O-3 2023 Revision | COD to site recode ICD-O-3 2023 Revision Expanded (1999+) | Vital status recode (study cutoff used) | Sequence number | First malignant primary indicator | Primary by international rules | Record number recode | Total number of in situ/malignant tumors for patient | Total number of benign/borderline tumors for patient | Age recode with single ages and 90+ | Year of follow-up recode | Patient ID | Type of Reporting Source | Marital status at diagnosis | CoC Accredited Flag (2018+) | Median household income inflation adj to 2023 | Rural-Urban Continuum Code | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6091 | Black | Male | 2022 | Not PRCDA | Stomach | C16.6-Greater curvature of stomach NOS | 166 | L | 9 | L | Positive histology | GIST: Gastric and Omental | T3 | N0 | M0 | 1B | 32 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 000 | 100 | 000 | 00 | 090 | 090 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0005 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 78 | 2022 | 22442286 | Hospital inpatient/outpatient or clinic | Widowed | ANALYTIC abstract from facility WITH CoC accreditation | $75,000 - $79,999 | Counties in metropolitan areas ge 1 million pop |
| 6092 | Black | Female | 2022 | Not PRCDA | Stomach | C16.9-Stomach, NOS | 169 | L | 9 | L | Positive histology | GIST: Gastric and Omental | T4 | N0 | M0 | 2 | 33 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | Yes | Systemic therapy after surgery | 000 | 100 | 000 | 00 | 103 | 103 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0005 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 58 | 2022 | 22442404 | Hospital inpatient/outpatient or clinic | Married (including common law) | ANALYTIC abstract from facility WITH CoC accreditation | $75,000 - $79,999 | Counties in metropolitan areas ge 1 million pop |
| 6093 | Black | Female | 2022 | Not PRCDA | Colon And Rectum (Excluding Appendix) | C18.7-Sigmoid colon | 187 | H | 9 | H | Positive histology | GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal | T3 | N0 | M0 | 3B | 40 | 4 or more regional lymph nodes removed | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | Yes | Systemic therapy after surgery | 000 | 100 | 000 | 00 | 072 | 072 | 16 | 00 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0001 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 67 | 2022 | 22443599 | Hospital inpatient/outpatient or clinic | Single (never married) | ANALYTIC abstract from facility WITH CoC accreditation | $75,000 - $79,999 | Counties in metropolitan areas ge 1 million pop |
| 6094 | Black | Female | 2022 | Not PRCDA | Stomach | C16.9-Stomach, NOS | 169 | L | 9 | L | Positive histology | GIST: Gastric and Omental | T2 | N0 | M0 | 1A | 30 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 079 | 100 | 000 | 00 | 038 | 038 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0002 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 57 | 2022 | 22443657 | Hospital inpatient/outpatient or clinic | Married (including common law) | ANALYTIC abstract from facility WITH CoC accreditation | $80,000 - $84,999 | Counties in metropolitan areas ge 1 million pop |
| 6095 | Black | Male | 2022 | Not PRCDA | Small Intestine | C17.9-Small intestine, NOS | 179 | L | 9 | L | Positive histology | GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal | T2 | N0 | M0 | 1 | 30 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 000 | 100 | 000 | 00 | 033 | 033 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0007 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 51 | 2022 | 22443676 | Hospital inpatient/outpatient or clinic | Single (never married) | ANALYTIC abstract from facility WITH CoC accreditation | $90,000 - $94,999 | Counties in metropolitan areas ge 1 million pop |
| 6096 | Black | Male | 2022 | Not PRCDA | Small Intestine | C17.9-Small intestine, NOS | 179 | 9 | 9 | 9 | Positive histology | GIST: Small Intestinal, Esophageal, Colorectal, Mesenteric, and Peritoneal | TX | N0 | M0 | 99 | 00 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Not recommended | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | Unable to calculate | 100 | 000 | 00 | Unknown or size unreasonable (includes any tumor sizes 401-989) | 999 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0000 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 36 | 2022 | 22443682 | Hospital inpatient/outpatient or clinic | Single (never married) | ANALYTIC abstract from facility WITH CoC accreditation | $90,000 - $94,999 | Counties in metropolitan areas ge 1 million pop |
| 6097 | Black | Male | 2022 | Not PRCDA | Stomach | C16.2-Body of stomach | 162 | 9 | 9 | 9 | Positive histology | GIST: Gastric and Omental | T4 | N0 | M0 | 99 | 00 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Not recommended | None/Unknown | Yes | No systemic therapy and/or surgical procedures | 008 | 100 | 000 | 00 | Unknown or size unreasonable (includes any tumor sizes 401-989) | 210 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0003 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 60 | 2022 | 22443698 | Hospital inpatient/outpatient or clinic | Married (including common law) | ANALYTIC abstract from facility WITH CoC accreditation | $75,000 - $79,999 | Counties in metropolitan areas ge 1 million pop |
| 6098 | Black | Male | 2022 | Not PRCDA | Stomach | C16.3-Gastric antrum | 163 | L | 9 | L | Positive histology | GIST: Gastric and Omental | T4 | N0 | M0 | 2 | 33 | 4 or more regional lymph nodes removed | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | Yes | Systemic therapy after surgery | 331 | 400 | 000 | 00 | 122 | 122 | 15 | 00 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0009 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 64 | 2022 | 22443797 | Hospital inpatient/outpatient or clinic | Married (including common law) | ANALYTIC abstract from facility WITH CoC accreditation | $75,000 - $79,999 | Counties in metropolitan areas ge 1 million pop |
| 6103 | Unknown | Male | 2022 | Not PRCDA | Stomach | C16.2-Body of stomach | 162 | L | L | 9 | Unknown | GIST: Gastric and Omental | T1 | N0 | M0 | 1A | 00 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Not recommended | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | Unable to calculate | 100 | 000 | 00 | 019 | 019 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0004 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 87 | 2022 | 22445847 | Hospital inpatient/outpatient or clinic | Widowed | ANALYTIC abstract from facility WITH CoC accreditation | $90,000 - $94,999 | Counties in metropolitan areas ge 1 million pop |
| 6104 | Black | Female | 2022 | Not PRCDA | Stomach | C16.6-Greater curvature of stomach NOS | 166 | L | 9 | L | Positive histology | GIST: Gastric and Omental | T2 | N0 | M0 | 1A | 30 | NaN | None; diagnosed at autopsy | No radiation and/or no surgery; unknown if surgery and/or radiation given | Surgery performed | None/Unknown | No/Unknown | No systemic therapy and/or surgical procedures | 106 | 100 | 000 | 00 | 041 | 041 | 00 | 98 | No | No | No | No | None; no lymph node metastases | None; no other metastases | Alive | Alive or dead of other cause | Alive or dead due to cancer | 0004 | Complete dates are available and there are more than 0 days of survival | Alive | Alive | Alive | Alive | One primary only | Yes | Yes | 1 | 1 | 0 | 58 | 2022 | 22445878 | Hospital inpatient/outpatient or clinic | Divorced | ANALYTIC abstract from facility WITH CoC accreditation | $75,000 - $79,999 | Counties in metropolitan areas ge 1 million pop |